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COMPOUNDS AND METHODS FOR IMMUNOTHERAPY 
AND DIAGNOSIS OF TUBERCULOSIS 

CROSS-REFERENCE TO REFATED .APPLICATIONS 

Tnis application is a continuation-in-part of U.S. Application 
No. 9/025.197. filed February ,8. 1998; which is a continuation-m-pan of 
L-.S. Application No. 08/942.578. tiled October 1. 1997, which is a continuation, n-pan 
of U.S. Application No 08, SIX. I 12. filed March 13. 1997; which is a co.mnuauon-.n- 

- . - — . ^-'uw,^ , , wiiicii v^tiiii.s pnoniv 



:Vom PCT Application No. -CTVS ^ 14674. riled Aimust N; i d % 
continuauon-m-part of U.s. Application No. 08/680.574. tiled July !?. ]00 6 ; which i< , 
continuation-in-part of U.S. Application No. 08/(559.683. Hied June 5. 19%; which is a 
continuation-in-part of U S. Application No. 08,620.874. filed March 22. mo,, novv 
Uband0ncd: wh,ch ls 1 ^nunuation-in-pan of U.S. Application No 08/533.63-U :lled 
• SL * p,embCr2: - ! " 5 - ™ abandoned: which ls a continuation- ,n-par: o: 
U.S. Application No 0 X .5:;,^„. riled September 1 . u>9 5 . now abandoned. 

rF.CHNICAL FIELD 

The present invention relate, -neruiiv ;„ aetectinR. :reati,n; ana 
preventing Shcobaacnun, :uh,-rcuios,s injection. The invention >s more pumeuiariv 
-■iated to polypeptides comnn.smu a \>vcn» m <m »<hrrcuios,s am,,en. « r a portion 

;\<r. r-:-\:u vr aa<;no;a:L: ana aaa;;;a:;;h; 
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" " *~ «** '"" SS - — • <" - • npmdiKtnv 

coush. If ,c„ unrated. OT „ complications and dead, rypicallv resul , 

Although tuberculosa can general,, be controlled us,„ c extended 
T " toPy ' S " Ch " »* » P-en, the spread of *e dtsea* 

aJ-hough c„mp„ Mce ^ the mm reg , mcn fc mucA ^ bchay]or B • 

.o So mc pa U e„,s Jo „ comp ,„ e „ e ^ Qf whidi ^ ]ead 

.nenect.ve treatment and the development of drag resistance 

^ lnh,b„,n 5 the spr ead of tuberculosa recces effeem, vaccnatton and 

' J,a? "" S,S °' * to O — * vaccnanon »„„ „ bact.na , •„- 

™ s: ,„c«.,t « ,„ : „ ducmg DrMcciivo _ m(Ki ^ 

NKcooactenaro etnp,oved f„, , nK purposc ls w/ „ c,,^,^ 

™,en, su™ of M^,™ „„ m H„„eve, the safetv »d eflicaev of BCO is a 
-urce a, control, an, s „ mc ^ ^ . ^ ^ ^ ^ 

- : "" al PUbi ' C U " S ""' S :S — * ™ed -» ? a s k ,„ test. .„ ch lnvolvc , 

CXP '"" re '° ' Ub " iUi '" Pm J ™«> A„„oe„- sp ee„lc 

' rCSDn " S " reSm ' " — « ! " d ™™ » *e ,„,„„„„ S1 ,e „, „„„„ 

~ Wh,t " :M ' UKS — " ,e,.,„,„ v ^d 

'7:;;: cvcr - j ■- ■•»* « « ' 

■■Mth .« cannot 0eo, 5 „„ga,s„eo „ 0 „, !mmiuilall 

Wbic TOCr „„ na?K ha „ sii<nm ;o jk ^ iK ^ ^ 

_a ir L .^Liommam imiuctrrs ^w' such immunay 
"^ : r " I! : ' 11C " : ' ::1 " n 'tectum auamst l: .'/^-^p,;, . nIec „ ()T1 

^ iW - ■ -tit-r. t s. aur \, depletion oi 

i " S ° C:aiC ' J ' vV,th :iuinan ^nniunodciicicricy v !n]S . H J\'. :n recrioi; 

^amma-mtcrrcron -IFVv. which. in - U n his .u 
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"""" """^ ■* l '--^iMroxv-v, m „, n D; SlmiIall , 1L . p has 
S ho™ ,„ p lay a ,o, e ,„ 3 „ mulanng reslsmicc M „ Iubercij . ms mfecuon ^ ^ 

'~ - «W. B,„o,„ fed., ASM Press 

Washington, DC, 1994. 

According!, .here ,s a need in the an for imp roved vaccines and 

methods tor prevent^, treating and detecting tuberculosis Th, 

"'=• 'uoercuiosis I he present invention 

toU " iS ' hCSe ne=ds ™ d ^ P^Wdes other .-dated .tdva,,,^ 

SUMMARY OF THE INVENTION 

Br«lly stated. , hls mV5n , Km ^ ^ 

preve»„„ B md diagnosmg , u6 „ cu]osis ^ ^ ^ Do|vptpudts ^ ^ 

=o mD ns,„ s » :mm „„„ s e m c ^ 0I , s „ k]b|e „, Wni( . ju am]gcn ^ ^ 
.«» » M „,e„ *» d.fe onlv ,„ c „ ns „,„ vc subsmui „ TO w aodjfaiom 
e,„,„d, m en, of .hi, ^ lhe soiufcle ^ has ^ 0{ fc ^ 



sequences; 



A.D-Pro-V,i-As D -AlaAai.Mc-A S n-7h r .r! ir -C;,s-Asn-Tv r -(. ; i v . 
win-Vai-Vai-Aia-Aia-LL-i;: ,SFQ ID N (! . ;: 0) 

Aia-Vai-Oiu-.ser-r.iv-Vlet-Leu-Aia-Leu-uiy-Tlu-Hro-Ala-Pro. 

ier: <SI-.() !D \o. 121 ) 

'■-i-Aia-Me: ■. -,-\'to- .\, !; -Thr-Giv -a.;p-v ,iy-|> ro -i 
Au-i.vs-wiu-vih-An:; ^i.i.> :I) \ 0 

' ' ! U ' 1 ■ ;> -' '■• s - ; ' r °- 1 -uin-l' I( .-|'he-A.sp-Pro-.\ia- 1 riM iK- 
' SHQ !D No. :r., 

A.sp-Iie-(jiy.>s n .o!u-Ser rhr.Ciiu-Asp-r.in-Gin-Xaa-AlaAai: 

'S! 0 ID No. 



wo ^9 -t:ir<> 



"» Ah - p '-L>,.n^Tv r .xx i - Glu . <;lu . L<!u . Ly , Glv . rnr . Asp _ r . r 

GlyJSEQ ID No. 127) 

Thr-Ser-Leu-Leu-^n.Ser-Leu-AJa-^Pro-Asn-Val-Ser-Phe- 
AJa-Asn; (SEQ ID No. I2S) 

(j) X --^er-Glu-Lvs-Scr-Ala-Tl 1 r- I ,e-Lvs-Va,-Tl lr - Asp . A]a . 
Ser; (SEQ ID No. 134) 

Asp; (SEQ ID No. 135) or 

(,) Ala " Pr0 - , ' 1U - Ser - G!v - Aia ^--I---Glv-Glv-Tnr-Va!-Gin-AL 
Gly: (SEQ ID No. 136) 

whercmXaamaybcain ammo ac.d. 

™-o S e„, c ; ra i:; , :;r''; ohpcm,dc5 - — - 

to Mowing N- Kmll „a! scqueilres; " al "« 

lle-A Sn -V,i.|,, s . L , u . Vai .. SE0|[)Vu , 3Tiot 

• 0 -^P-! > nv t i.,-^in-Xaa-Asp,Vlct-ihr-L;..-<,iv-Tvr- 
Iyr-Pro-Gh-Gly-Aru-Arg-Xaa-Phe: I SEQ ID No pg, 
wncrem Xaa may be any ammo add. 

in mother embodiment -in- somri,- 1/ -„A 
., ni ,.„ . " ."/, t r™,,v, v annucn comprises an 

■Jni..o . !ua ,euucncc encoded bv i r>\ \ . „ 

<<- 'cauences recited :n in v 

complements of said .•icquene-s an i " ^ 

4 " JnJ J:vA ^'uences that hvhruii,e ;o a seuuen.- 

^c:tea :rl SEQ ID Nov ^ 4 ,„ ,, Hj " ^ Uinu 

,„„;.., m -i , ^' and 10! or a comniement thereof 
^crmoacrateivstnnL'en-con^r:,^ 

In a related :he -,„,. „...„, 
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am.no acid sequence encoded by a DNA sequence selected from rhe group consisting of 
the sequences recital in SEQ ID Nos.. 26-51. 138. 139. 163-183. 201 -40 -47.-47 
253-256. 295-298. 309. 316. 3.8-320, 322. 324. 328. 339. 333. 335. 337. 339 and 341." 
the complements of said sequences, and DNA sequences that hybridize to a sequence 
5 recked m SEQ ID Nos, 26-51. 138. 139. 163-183. 201, 240. 242-247. 253-256 -95- 
298. 309, 316, 318-320. 322. 324. 328. 329, 333. 335. 337. 339 and 341 o/a 
complement thereof under moderately stringent conditions. 

In related aspects. DNA sequences encoding the above polypentides. 
expression vectors comprising these DNA sequences and host ceils reformed or 
) steeled with such expression vectors arc also provided. 

In another aspect, the present mvent.on provides tusion nroiems 
comprising a first and a second inventive polypeptide or. alternatively, an inventive 
polypeptide and a known .V/. tuberculosis antigen. 

Within other aspects, the present invention glides pharmaceutical 
compositions that compn.se one or more of the above polypeptides, or a DNA molecule 
encoding such polypeptides, and a physiologically acceptable earner. The invention 
jiS ° Pr ° V,deS VaCC1DeS ^ om P™ng one or more of the polypeptides as described above 
and a non-specific :mmune response enhancer, together vv,th vaccines comprising one 
or more DNA sequences encoding sue, po.ypepodes and a non-specific immune 
response enhancer. 

In yet another aspect, methods ore provided for inducing protective 

:mmUn ' :V ' n ^ -™P™«* *Jrnm,«enn r :„ , pa ,,em an effective amount .„ , ilc 

■>r :norc )i ;ne arnvc poivpciMiues 

J " nL ' ' r :UL: 'I1 v ^:iiipii. inemoas ana Jiaenusiic arc 

prov^e, u, Electing turercu.os.s ,n , patient. The methods compn.se contacting 
dC ™' di " :1S °' a m '• V!,h " r ™e of the above po.ypeptuies and deteetinu an 
:mmune response on the nauent s sk:n The d,a,most,c kits comprise one or more of tne 



inov, :-:> !x T uue> :r. ..vrr.bmat.or, u,::, , t:: apparatus sufficient t, cont.v 
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In yer other aspects, method, arc provided for detecting tuberculosis in a 
patten, such methods comprising contacting derma, cells of a patten, with one or more 
polypeptides encoded by a DNA ^uence selected from the group consisting of SEQ 
IDNos, 3, 11. 12, 140. .4,. ,56-160, 189-193. 199,200.203,2,5-^5 ^37 ,39 , 61 

and ,42-347. the complements of said sequences, and DNA sequences that hvbndtze to 
a sequence recited in SEQ ID Nos, 3, ,.. ,2. ,40. ,4,, ,56-160. ,89-193 ,99 ^00 
203. 2.5-225. 237. 239. 261-276. 292. 293. 303-308. 3 , 0-3 ,5. 3 1 7. 321. 323. 325-3-' 
-0-332. 334. 336. 338. 340 and 342-347; and detecting an immune response on the 
' ?allem S Si ° n D,ayn ° St,C ktts such methods are also provided. 

Those and other aspects of the presein invention will become apparent 
upon reference to the following detailed description and attached drawing All 
references disclosed hercm are hereby incorporated by reference tn their entire* as if 

each was incorporated individually. 

BRiFF DESCRIPTION OF n* DRAWINGS AND SHOUFNCE IDENTIFIERS 

F.^e LA I) iita raKlhc Sllmuiat , 0n o, ^,,,^^,, ^ im „ cron 

: =roo ura „„ ,„ r c=„ s cd lrom , ^ a sctuM „ ,.„„„ r „„ mMmmum . donor ' • 

resnectivelv. bv the !4 t\< "n w i -.r,^ -v •■ ■ 

. . ,4 t . u . ana .6 u antigens desenbed in Example 1 . 

F.gure 2 illastrates the stimulation of proliferation and mterteron-v 

proauction :n I" cells derived -'rom 

.rom ji v/ ; «P,.r 1 -.v/ (W ;.v-,mmune mdivuiuai tn :ne IWO 

representative polypeptides IhRa, . uu | TbR a o 

Figure, : A .:> illustrate the reactivity o, annsera ru.seu ueamst secret 

l/ :Ut>CrCWOm Pr ° ,anS - ' he k ™ l/ »ti B e„ S5b an, the inventive 

anngens 1,38- , and ml,-, re W i>, w>th ,,. fttW , ( ,, 0 ,,,, ^ , } „ 

!Uoercuiox " s <-™tory pro,e:ns Mane H re ,, mhina „. p^., 4 . 

< r\ 1 ' '.am; and rccomnmaru X-b < mmc ~ .. 
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figures 5A and B ■11,^^ >u 

P . ° ells by ^ f^'on protein TbH9-7V,8-i 

■ o> J« .uiion Protein TbH9- IT).1S. i 

Figures 8a arid H ; lln«r-,, u 

^ ■ —„« p „, pepI , dcs xp , mn mm ^ ^ 

wr... " ' " DO,VpC °" des • XP " 1 ' «»* «>.f,o md 

SEQ. :DNo ; is the I) v a 

'iuie JJ,\a sequence of ITiRul 

■ --.i UicDNA sequence ofTbRalO 

SEQ. ID N'() 1 is the DM i 

mL UNA ^quence of TbRal i 

■SEO. ID No a r S Jhc nv , 

■SEO !D No. ^ !StneDN - , ., 

L 1JlVA •••«?uence pi TbRal ■ 
s EO. JD NO „ ,, teDV v 

ltU,A ^iuence of TbRal (, 
■SE(; ID NO - , s n\ i 

•» tncD-vA sequence ofTbRar 
S EQ. ID NO. 8 , s the 0\ \ 

' neUNA sequent of TbRal 8 
SEO ID NO g IS .J,,, nK . 

is ji t UNA sequence of TbRal o 

^En ID NO. :.), S rh c n\- \ ■ 
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SEQ. ID NO. 13 is the DN'A sequence of TbRa29. 
SEQ. ID NO. 14 is the DNa sequence of TbRa2A. 
SEQ. ID NO. 15 is the DNA sequence of TbRa3. 
SEQ. ID NO. 16 is the DNA sequence of TbRa32. 
SEQ. ID NO. 17 is the DN'A sequence of TbRa35. 
SEQ. ID NO. 18 is the DNA sequence of TbRa36. 
•SEQ. ID NO. 19 is the DNA sequence of TbRa4. 
SEQ. TD NO. 20 is the DN'A sequence of TbRa9. 
SEQ. ID NO. 21 is the DN'A sequence of TbRaB. 
SEQ - FU NO 22 * DNA sequence ofTT,Ra C . 
SEQ. ID NO. : 3 ,s the DNA sequence of TbRaD. 
SEQ. ID NO. 24 1S the DNa sequence of YYWCPG. 
SEQ. ID NO. 25 is the DNA sequence of AAMK. 
SEQ. ID NO. 26 is the DN'A sequence of TbL-"^ 
SEQ. ^N0.27 1;;:henNAsequenccorrbI _ 2 , 

SEQ. ID NO. 28 is the DN'A sequence of TbL-25. 

SEQ. ID NO -9 ,s -h t - dva 

-y is >ne L 'NA sequence of TbL-28. 

SEQ. ID NO. -0 is the HVa 

- u i-s .nc ,J\A sequence ot I M.-29. 

SEQ. ID NO ^ - s me 'w \ 

SEQ. ID NO , s .; le [)v . . 

■ - ii ..ie L jy.,\ sequence ot TbH-X. 

SEQ. ID NO i s thp n\i i 

LS me ,;N A sequence ot TbH- c >. 

SEQ. ID NO U -s 0\' 

• N u ' ->equence u , b.\l-; 

SEQ. ID NO Is :Jl ,. rr , . 

- >■-> wit Miuuence oi ■ h\\- ; 

SrQ ID NO :„ ., -s.- , 

■ ' • ' v\ .sequence o: :':.Al-o 

SEQ. ID NO. • > .», e :) v . . 

■ a sequence oi InM-"' 

SEQ. ID NO. -X is -he nv \ 

■s i.c u . A sequence :>r j h\l-0 

SEQ. ID NO :o ls -he Ds x . 

nt - u ' \ sequence o: IbM-P 



SEQ ID NO 4.1 :r ,.. - )N A 



^e.iuence ■: I b\;. 
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SEQ ID NO. 43 is the DNA sequence of TbH-4. 
SEQ. ID NO 44 is the DNA sequence of TbH-4-FWD. 
SEQ. ID NO. 45 is the DNA sequence of TbH-12. 
SEQ. ID NO. 46 is the DNA sequence of Tb38-1. 
SEQ. ID NO. 47 is the DNA sequer.ee of Tb38-4. 
SEQ ID NO. 48 is the DNA sequence of TbL-17. 
SEQ. ID NO. 49 is the DNA sequence of TbL-20. 
SEQ ID NO. 50 is the DNA sequence of TbL-21 . 
SEQ. ID NO. 51 i s the DNA sequence ofTbH-!6. 
SEQ. ID N( i. <: ij ^ DNa sequem;e 0 ,- DpEp 

SEQ. ID NO. ?3 is the deduced ammo acid sequence of DPEP 
SEQ. ID NO. 54 , s , he protein sequcRi:e of Dpv ^ 

SEQ. ID NO. 55 ,s the protein sequence ofAVQS N-termina. Antmen 
s SEQ ' ! D N ° 56 ' S thL ' Pr ° tei " s ^e»ce of AAMK N-temunai Aniieen. 
SEQ. ;D NO. 5- ,s the protein sequence ofYYWC N-tennmal Antmen 

SE< ) IT) NO ^ iv: rM > 

- ->■< is tne protein sequence of DIGS N-terminal .Antigen 

SE( X ID NO 50 is the protem sequence of A EES N-iermmai Antmen 
", E0 - ID N ° "° :S ^ lcin s «^" rfDPEP N-termtnal Anu.e, 
^ f N<) ' M )S lhe Dr0tan of APK7 N-temunai Antiuen. ' ' 

" LQ - ID VJ ° 3 ,s ,hc ded ^ed ammo ac,d sequence of PbRal 

■ ~Ju.ui .mum, Jaa scuucnc „ ,„'-;- hRal() 

- -1. ULU .uri-n.o ; , C](J , cullL . nCL , .,;--;> Rai • 

• me Jeuucea ammo acid sequence of FbRji:. 
lD ^ " 7 1S ,he Jt " uc ^ » ac,d sequence of TbRal 3 
■V Q II. > .-,< ) 08 ;.s the deduced ammo oc.d scuuence of TbRal 6 

SFQ ID N(> -,0 ; S ; : 

e ^'^..d -inmo acid sequence .ifTbRai - 

VO.) !I> \( . -o ... ,. . 

-u.uu-.. ammoacidsecim-iu,. >f7-Ra:N 
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S^. ID NO 73 is the deduced an.no acid sequence „f TbRa26 
SEQ. ID NO. 74 1S the deduced ammo acid sequence of TbRa28 
SEQ. ID NO. 75 is the deduced ammo acd sequence of TbRa29 
SEQ. ID NO. 76 ,s the deduced amino acid sequence of TbRa2A 
SEQ. ID NO. 77 is the deduced ani.no acd sequence of TbRa3 
SEQ. ID NO. 78 ,s the deduced am.no acd sequence of TbRa3"> 
SEQ. ID NO. 79 ,s the deduced axn.no acid sequence of TbRa3< 
SEQ. ID NO. SO is the deduced am.no acd sequence of TbRa36 
SEQ. ID NO. ft, is the deduced mmQ ^ sequgnce orn>Ra4 

seq. id no. , : is the dcducKj ammo aad scaumc5; u . ^ 

SEQ. ID NO. S3 is the deduced am.no acd seuuence of TbRaB 
ShQ. ID NO. 84 is the deduced amino acd sequence ofTbRaC 
SEQ. ID NO. 35 is the deduced am.no acd sequence of TbRaD 
^Q ID NO. 86 is the deduced am.no acd sequence of YYWCPG 
SEQ. ID NO. 87 is the d ^ d „ ^ of TbAAMK 

SEO. ID NO. 88 is the deduced um.no acd seauence of 7*38-1 

SI - Q ,D N ° 89 is the ^ ™o acd seauence of TbH-4 
SEQ. ID NO. on IS the deaucca amino ac;u seuucncc oiThH {{ 

SCO. ID NO. o, , thc dcduce . ^ . ^ 

seo id no. o : IS the i!educcd MIno ac; . scquence o , T , h _ i: 

SEQ. ID NO. g 3 is [he ^ ^ ^ 
ShO ID NO ^,s: h , aminoacid ^ u ^ i; . rh ^_. ^ ^ 

SF( ) ID \'( i .K,., h 

" ^ - • » ^ ^nmo ac:u sequence •.fTh3X : i'eptide 

' " V ••'« •' um.no aci sequence I^N-' p ; . nllde 4 

St( ) il) \( > >.)- ,«- ,u . 

'•' ' lm " u ' aaa sequence ,»fTMX- : Pfpt.de - 

* s JiL I; VA st\;u L -n CL . i): i;u) ^ s 
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SEQ. ID NO. 103 is the DNA sequence oi'ESAT-6. 
SEQ. ID NO. 104 is the deduced amino acid sequence of ESAT-6. 
SEQ. ID NO. 105 is the DNA sequence of TbH-8-2. 
SEQ. ID NO. 106 is the DNA sequence of TbH-9FI.. 
5 SEQ. ID NO. 107 is the deduced ammo acid sequence of TbH-9FL. 

SEQ. ID NO. 108 is the DNA sequence of TbH-9-1. 
SEQ. ID NO. 109 is the deduced amino acid sequence of TbH-9-I. 
SEQ. ID NO. 1 10 is the DNA sequence of TbH-9-4. 
SEQ. ID NO. 1 i l is the deduced ammo acid sequence of TbH-9-4. 
SEQ. ID NO. ! 12 is the DNA sequence oiTb38-lF2 IN. 
SEQ. ID NO. us is the DNA sequence oiTd38-2F2 RP 
SEQ. ID NO. 114 is the deduced amino ac:d sequence of Tb37-FL. 
SEQ. ID NO 115 is the deduced ammo acid sequence of Tb38-IN. 
SEQ. ID NO. 116 , s the DNA sequence of Tb38-IF3. 
! " SEQ - 10 N0 - 1,7 :s lhe Educed ammo acid sequence ofTb38-IF3. 

SEQ. ID NO. 1 1 8 :s the DNA sequence of Tb38-1F5. 
SEQ. ID NO. i 19 :s the DNA sequence ofTb38-lF6. 
SEU. ID NO. : 20 :s the deuueed N-temunai ammo acid sequence of DPV. 
' ' D N ° •" ! : ' :hc a( -' a "ced \-tcrminai amino acid sequence of A VGS. 
-» SEQ. ID NO. 122 :s the deauced N-temunai amino acid sequence ot AAMK. 

SEQ. ID NO. !23 is the deduced N-temunai amino acid sequence of YYWC. 
SFO ID NO. ; _a ; -he jeduccd N-ierrr.inai amino acid ..equence of DIGS 
SL(.). ,D NO ; , ;h L - deduced N-termmai ammo acid sequence of AEES. 

sho. ) \i ' - ^, , v . 

* ^"->--d .'-icrmmai mimo aciu sequence of DPE!' 

>r( '' ' Ll sl ' '"■ ::j :;lc ^uced N-ierminai ammo acid sequence ot APKT 

'' lD N< ' ! - S s Educed amino aciu sequence of DPAS 
SLU U " S ° * ^ P™' " sequence ,f DPPD N-rermmai Am, C en. 

— A ' ; i roit;:r i seuuence:. ,>f :our :)PP!) .A.inoue- 
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SEQ ID NO. 135 is the N-terminaJ protein sequence of AGO ant, C en 

SEQ ID NO. 136 is the N-termina] protem sequence of APE antigen. 

SEQ ID NO. 137 ,s the N-terrrunai protem sequence of XYI antigen. 

SEQ ID NO. 1 38 is the DNA sequence of TbH-29. 

SEQ ID NO. 139 is the DNA sequence of TbH-3C. 

SEQ ID NO. 140 is the DNA sequence of TbH-32. 

SEQ ID NO 141 is the DNA sequence of TbH-33. 

SEQ ID NO. 142 ,s the predicted ammo acid sequence of TbH-29. 

SEQ ID NO. 143 is the predicted amino acid sequence of TbH-30. 

SEQ ID NO. 144 is the predicted ammo ac:d sequence of TbH-32. 

SEQ ID NO. 145 is the predicted am.no acid sequence of TbH-^ 

SEQ 10 ,46 - ]51 ~ PCR « «* " the preparat,n of a fusion 

protem containing TbRa3. 38 kD and Tb38-1. 

SEQ ID NO: 152 is the DNA seuuene- nf >h f 

sequent ot L ne fusion protem containing ThRa3 

j 8 kD and Tb38-I. 

SEQ !D NO: m , s „ ^ ^ ^ ^ ^ 

^Raj. j8kDandroj8-I. 

SEQ ID NO- ! ^4 ; s nv \ . 

,SlheDNA ^^nccot:^.l/ /^rcwawj antigen ^8kD. 
SEO ID No iff IS lhe amino , sc . . 

■ ^ 01 Iht A/ twn-mosis antigen 38 

SEQ ID NO- ! -o is the nv \ . 

^ * " is tne U:\A sequence m XP14. 

SEQ ID NO. !> - is the f)\ \ - 

i- is int i^a sequence oi XP24 

SEO ID N(V i < (S , . r i-i ,» nv ^ 

1 oK JJlVA * .sequence :>i XP" ] 

SEO ID N( ) ; - o >s - hc nv , . 

SEQ ID NO: Jo, ,s the predicted ammo ac:u .euuenee ot XP14 

SEQ ID NO o2 ,s the .e„„i Jmino ac:J ^ ^ ^ ^ ^ 

-(implement m XF'! i 
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SEQIDNO: 1 65 is the 5' DNA sequence of XP4 

SEOIDNO: 166 is the 5' DNA sequence of YP5 

SEQ ID NO: 167 is the 5' DNA sequence of XPI7 
SEQ ID NO: 168 is the 5' DNA sequence of XP30 
SEQ ID NO: 169 is the 5' DNA sequence of XF> 
SEQ ID NO: 1 70 is the 3 ' DNA sequence of XP2 

SEQ ID NO: 171 IS the " DM v 

<i me „ UNA sequence of XP3 

SEQ ID NO: 1 72 is the v DM i 

5 me J D - N ^ sequence of XP3 

SEQ ID NO: ! 73 is the ^' DN \ 

-i^tne? UNA sequence of XP6 

SEQ ID NO r 4 is theV D Kr A 

^ - urs,A sequence ofXP6 

SEQIDNO: i o, the 5- DNA sequence ofXPlS 
ID NO: >76 I s t |,3-DNA.se q ue„ceofXP, 8 
SEQ ID NO: 1 77 is the 5" DNA sequence of \PIQ 
SEQ ID NO; - 8l stheV DNAsequenceofxpi9 

SEQIDNO: P9 is the " r>N \ - 

me _ ij!\A sequence ot XP22 

SEQIDNO: ISO istfaeVnVA 

- UNA sequence of XP2" 1 

SEQ ID NO ; lS1 ; h -. n 

mt. _ u\.-\ seuuencc ofXP^S 

SEQIDNO; :r„,| r v^, 

• "i- - o\ A sequence or" XP"K /. 

SEQ ID No- -si „,„ """ ■ 

SEQ ID NO- t ^---^nceor^XP,. 

■ 8> -s the predictcd aimno aad 

"Knolemem or'~bHJ-XP! reversc 

SEO !D \'i ' '.S(, 

- » -.rst :;redic:eu ammo .icu' sem ,• . ■ • . 

<}■■< < m> v, .. SLqilc '-'- t encixiea in- \'p;; t , 

seq id \( i :; s ;;; conjprcuia -- i -^---enco d e^ VX p., 

cogent of X P36." ' ~ amm ' > ^ * —e 

SEQIDNO : so ;, :iv nx v . 

Sr<J ;D N ' ' ^^sthc'lNv - 
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SEO ID NO: 195 is the DN'A sequence of RDIFl I. 
SEQ ID NO: 194 ,s the predicted ammo ac,d sequence of RDIF" 1 
SEQ ID NO: 195 IS the predicted amino acid sequence of RDIF5 
SEQ ID NO: ,96 is the predicted ammo acid sequence of RDIF8 
SEQ ID NO: 197 , s the predated ammo acid sequence of RDIF10 
SEO ID NO: 1 98 is the predicted am™ ac.d sequence of RDIF, 1 
SEQ ID NO: 199 ,s the 5" DNA sequence of RDIFP 
SEQ ID NO: 200 is the 3' DNA sequence of RDIFI2 
SEQ ID NO: 20i , s the DNA sequence of RDIF7. 

<- -0- .s me predicted amino acid sequence of RDIF"" 
SEVIDNO: 203 is the DNA .sequence of DIF2- 1 
SEO ID NO: 204 ,s the predicted amino acid sequence of DIF2-I 
SFQ ID NO; 20^ P ^ ppd 

are PCR pnmers used in the preparation of a fusion 
P™em contains F bR a, 3S k n. Tb38-I a, d DPEP (hercinafter refemd „ ^ 

SEQ ID NO: 213 ,s the DNA sequence of the fusion prote.n TbF- 

SET) ID \{ )• ~m j ;,- r u 

- ■ • -i4 ,s rhe ammo acid sequence of the u^on orcein Tbf^ 
SEQ ID \(); 2P is the n\M 

lb [nc - IJN A sequence of MO- I . 

" s ,nL - JNA sequence lor M()-: 

* ulL • L ^A sequence lor MQ4. 

SEQ IO \0: :i8 i s the " DM \ • 

*■> vne jna sequence tor MO-,X. 

SEQ IO \(); JMJ :s ~ v 

n - ^.V\ .sequence lor Ml u 

SEo ii; \ ( > , t 

■ AU -JV-\ seuuence :or MO-^ 

■ S K> 'P v. > ,. T 

^ ' ; - v ^ ■■t-'quence Mr MU-:s. 

3 . J,\A .sequence lor M( i-jo 
SEOfD\(v :: ; !s . h ■ 

tUL - i>^A sequence tor \tO- V) 

J ■ * ^ sequence fur \1< >- 

SEQrr»N«. ;s :Jv . !)K . , 
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SEQ ID NO: 228 iS I* pred[Cted amuK) ^ equence . r ^ 

SE0 ID N ° : ~ 9 ' S ^ P^ted arnino acid sequence for MO-8 
SEQ ID NO: 230 !S the predicted ammo aad sequence for MO-9 
SEQ ID NO: 23 1 IS the predicted ^ aad for ^ 

SEQ ID NO: 232 IS the predictcd Mino aad ^ ^ 

SEQ ID NO: 233 ,s the predated ammo acd sequence for MO-29. 

234 is the predicted amino acid sequence for MO-30. 
2-15 is the predicted ammo acid sequence for MO-M 
SEQ ID NO- 236 ,s the predicted ammo acd sequence for MO-35." 
- v >u NO: IS the determined DNA sequence for MO-10. 
SEQ ID NO: 238 ,s the prcdictea ammo acd sequence tor Y10-H, 
SE(,IDNO::39 Isth e3- DN : AscqucnccforM0 ^ 
SEQ ID NO: 240 , the fu]] , ength ^ ^ ^ 

Z in NO; 2^2 ' ** PrCd,C:eJ IUi '" ieng:h M,n ° ^ — cc «* DPPD 



SEQ ID NO 
SEQ ID NO 



is the determmeii ~n\; \ - 

SEQ ID NO: 243 is the determ.nc- " -n\^ 

irK - lDNa sequence tor LSER-I1 
-SEQ ID NO; 244 is th e determined ^DN \ 

-nm.j ., ^UNA sequence tor LSER-1 2 
Sfc.J ID NO: 245 i s the determined <' -nv v ■ 

•"uiLL _ .u\.A .sequence lor I SFR-l "> 
SFO ID NO- "a^ ' " 

- " " S the a " em »n^ J DNA sequence for i.SFR „ 
SEQIDNO:247 ;s[hed ,. 

" tu - -UNA sequence tor LSER-2^ 
SEQ ID NO: 248 is the ;-n,i:c:ec amino acid sequence tor I SER-10 

M ' ( '' 10 N ° 2: '° iS lhe " rcdic: ^ ac:u scuuence for i SI R ; . 

SFO ID NO- -s: ,.. .. ' " ' 

m v *" ' %mmv 1C:J tor :..SFR- lt , 

VQ ID NO- 2^ . s • k,. , 

'■ ■ - .i- reside.: amino acid sequence for I SFR ^ 
SF-O ID N( ) ^ ., »l i ' 

- - ^e determined cDNA sequence for L SFR-J S 
SEQIDNO::54 IslheJ(;I , ' 

UI ' L " -"■V\ scuuence tor I SFR — 
SFQ ID N( ) .... ; „ "" 

-'■ s • ».qut ru.e lor R. _i 
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SEQ 10 NO; ; 5S , s , hc picdKKi ^ md ^ 

seq id ,o : s " prcd,c,ed -*» ac,d f » <^ 

SEO lo ' ! " C ° NA f » 

SEQ ID NO: 264 j s the determined v .nw * 

sfo ro vn - - ■ scquencc for LSER " 5 

M,Q ID NO: : b , IS the determined v C DN ^ 

SEO ID Nn -> - - ■ ' SCqUCnCt -' f0r LSER "6 

Lg N(): >s the determined ,dna s 

~ " UNA sequence tor LSFR-J? 

^^W: 267 is the determined,-- - D Na 

SFO ID Nn >„■ ^ DN ^ sequence tor LSER-i 4 

L * N0; -"8 is the determined " -DN \ ~ 
SEQIDNO-, (iQisth . rf . ^DNA sequence tor LSER-J5 

SFO m kv " ned 5 ' CDNA ^ UM « LSFR-1 7 

NO: 270 is the determined cDNA 

SFQ,I)No- , • cDN ^quencc for LSER-1 9 

1L> N(J - ' ' is the determined ^ r-nrv \ 

SEQ ID NO- - ls , he d , ' NA SCqUaiCe f ° r LSER --° 

SEQ ID NO ^ 5 " ^ ^ ^ 

sty ID NO: .7., ls the determined s- -n\u 

SFO ID No— • , ^-DNA sequence tor LSER-2o 

v iU AU - - - is the derermin^ -nv , 
SFO,nv ; . . ^^^^-ncetorLSER^ 



lor LSER-^ 



g NO: — 'S the determmed " -ON \ ■ 

SEQ.DNo^. sfhei . ■ ^ sequence tor LSKR-2Q 

— ° ^ me determined nv \ - , 

- u; ; \a .sequence 

sEo,D.o^,; s ,r pred ' c ' ca - ,m,n ' ,ac!d ^""^" LsER -' 



- ^ninu acid sequence rur LSER.;a 



^ >nc preuicte 

«™t::rr- ----- 

- - •- .de pa-dicta: .innno lc -o . .■ , - r 
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SEQ ID NO 



SEQ ID No 
SEQ ID NO 
SEQ ID NO 



288 is the predicted anuno acid sequence for LSER-2t 



' acid sequence for the reverse 



289 is the predicted amino acid sequence for LSER-28 

290 is the predicted amino acid sequence for LSER-29 

291 is the predicted amino acid sequence for LSER-30 
SEQ ID NO: 292 is the determined cDNA sequence for LSER-9 

SEQ ID NO: 293 is the determined cDNA sequence for the reve.e comment 
of LSER-6 

SEQ ID NO: 294 is the predicted amino 
complement of LSER-6 
<s^n in ^n.- ■ 

~~ v - 7J ,b inc a «ermined f oDN'A seuuence for MO-P 

SEQ ID No: 296 ,s the determined 5' cDN'A sepuence for MO-P 
SEQ ID NO: 297 IS the determined 5' cDNA sequence for MO-19 
SEQ ID NO: 298 is the determined f cDNA sequence for MO-39 
SEQ ID NO: 299 is the predicted ammo acid sequence for MO- 1 2 
SEQ ID NO 300 is the predicted ammo acid sequence for MO- 1 3 
SEQ ID NO 301 is the predicted amino acid sequence for MO-19 
SEg ID NO- 302 is the predicted amino acid sequence for MO-3Q 
SEQ ID NO: ',03 :s the de, Crm ,ned - cDN'A sequence tor Hrdsn-i 
SEQ ID NO: 304 :s lhc ^ ^ ^ 

^ ID NO: 305 is the determine- <• -n\ v • r- 

^--t-rmine^ dJVA secjuence tor Erosn-4 

SK0 ID 306 15 the J — i 5" cDNA sequence tor Erdsn-S 

SEQ ID NO: 307 is the determined 5' cDNA sequence tor Frdsn-n 
™™*>- ;( ^— -:i)NA^u^io r! ,dsn-^ 

sro ?n sr. 



■ ^ icquence [or nruin-rt 



-eiermme 

^"■^ - ; ihc .Icilttp int*' * -tin \ . 

..j.uiLv* _ ,u.\A sequence lor trasn-^ 

^ID \n- :s me determined 
^OiD \0:3i:, SI hedetemimed 

in no- :s rhc . it?ltrrTnine:i 



-DNA sequence for Krdsn-ln 
v^HNA sequence for Erdsn-12 
^ } VA M-quence for Erdsn-i • 
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SEQ ID NO: J 16 is the determined 5" cDNA sequence for Erdsn-16 
SEQ ID NO: 3 1 7 is the determined 5 ' cDN A sequence for Erdsn- 1 7 
SEQ ID NO: 3 1 8 is the determined 5' cDNA sequence for Erdsn- 1 8 
SEQ ID NO: 3 19 is the determined 5' cDNA sequence for Erdsn-21 
SEQ ID NO: 320 is the determined 5' cDNA sequence forErdsn-22 
SEQ ID NO: 321 is the determined 5' cDNA sequence forErdsn-23 
SEQ ID NO: 322 is the determined 5* cDNA sequence for Erdsn-25 
SEQ ID NO: 323 is the determined 3' cDNA sequence for Erdsn- 1 
SEQ ID NO: 324 is the determined 3" cDNA sequence for Erdsn-2 

SEO ID NO K flu- .fcrprmln^ V „m- , - „ , 

" v " - "-^i"/! sequence lor trasn— <■ 

SEQ ID NO: 326 is the determined T cDNA sequence for Erdsn-5 
SEQ ID NO: 327 ,s the determined 3" cDN'A sequence lor Erdsn-7 
SEQ ID NO: 328 ,s the determined 3" cDNA sequence for Erdsn-8 
SEQ ID NO: 329 is the determined v cDNA sequence for Erdsn-9 
SEQ ID NO: 330 is the determined 3' cDNA sequence for Erdsn-10 
SEQ ID NO: 33 1 is the determined 3 ' cDN'A sequence for Erdsn- 12 
SEQ ID NO: 332 is the determined Y cDN'A sequence for Erdsn- 13 
SEQ ID NO: 333 ls the determined 3' cDNA sequence for Erdsn- 1 4 
SEQ ID NO: v >4 , s the determined 3" cDNA sequence for Erdsn-! 5 
SEQ ID NO: 335 is the determined V cDNA sequence for Erdsn- lb 
SEQ ID NO: 336 1S the determined 3" cDN'A sequence for Erdsn- 17 
SEQ ID NO: -37 is the determined 3" cDNA sequence lor Erdsn- 18 
SF0 ID N, V "* « ™ v ,i)NA sequence :„r Erdsn-21 

SE ° ,D N " : ;,) ::; ** "etermmed - eDNA .sequence tor r^an,:: 
SEQ ID NO u» , !hc Jetsrmmc , e!)N v., quet.ee forErdsn-23 
>EQ ID NO 34] ,.s the determined 3' cDNA sequence tor Erdsn-25 
SEQ ID NO 342 is the determined cDN'A sequence 'or Frdsn-24 



SEQ ID \( 



••c determined .imm. - .u id .equence tor a 1/ :uh,-rcuiosis 
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SEQ ID MO: 345 is a determmed ammo acid sequence for spot 2 
SEQ ID NO: 346 is a determined am.no ac,d sequence for spot ^ 
SEQ ID NO: 347 IS the determmed ammo acid seq for spot 4 
SEQ ID NO: 348 is the sequence of pnmer PDM-157 
SEQ ID NO: 349 is the sequence of pnmer PDM-16C 

SEQ ID NO: 350 is the DNA sequence of the fus wil prolem ^ 

SEQ ID NO: 351 i s the am.no ac,d sequence of fusion prote.n TT,F-6 

SEQ ID NO: 352 is the sequence of pnmer PDM-176 

SEQ ID NO: 353 ,s the sequence of pnmer PDM-I75 

B 102 UNA 3C <? ucn « °f«he fusion prolan TbF-8 
SEQ ID NO: 355 ,s the am.no acd sequence of the fus.on P rote ln TbF-8 



DETAILED DESCRIPTION OF THE INVENTION 

As note d above, the present invention is general dtrected to 
-positions and ,«hods for P revenun g , treaun g and d,a g nos,n g mb ercu,os, 
COmPOSm ° nS ° f *" "*« *** Poises that compnse a, ieast one 

^ ^ * ~ — « ■ ™ - such an _en that 
— only ,„ conserve suostnunons anCor Nation, Po.ypept.des ^ the 

7 pcor the prescm ~ :ndude - « - ,o. :_ mc < oluble 

; h 'T " ng ' n 15 — - - W — - cu, ture filtrate. ,, s used herem 
P ° 1VPePtldC " ™« ofanv icngt , , )ciudmc 

pL * rtlUC ?rtnas J'hux a P oivpcp„d, ,omnnMn, -n — .... „„.„; - 

- 'i"i.)uenic portion ot one ol the 

•"mue.is mas ^onsisi entire v ,u' . 
Jcl ,„„„ ai . - " ,L —■■»«* P™"-- « ™v c„„ a „, 

— o, m * ilt ,, otlgl , us . and s „ ch _ 

immunogenic " 
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The composite and methods of m 

v™ or thc ^ polypcpodcs Md DNA ^ a 

r ° - -~ - * „„, : 

~' — **" ™— ~ *. *, ,e OT ,c. mll0 J 

™ JOr " <* P»l Wide « retained P„ lvpcp „ de „ " 

preferably exhibit at least abon, W r ™l !P cp„d t lanants 

- , , , ^ lbm " ° no " : P"f<=n.blv a, least about 90% m d most 

preferably at least about «% ide„,„ v ,„ , h ■ , r u 

ui , h , "*• ldenIlficd P»l>P'P>.des. For polvoept.de, 

Property. llnams ma? . al,c™„veh, be- identified bv modiw „„ 

die ammo acid sequence of „„,. „• , k . ' oa,r > m e 

on. „, the aocvc p „ lv „ dcs 

"—"^ * ■ Nr use,, J £ 

~ °' a,aS "" S ' K b — — • — - be : ue„„„ed * „a,uu,m 

7„,ed ™ - — " — -i- tba, detect t„c ^ 

aosence ot tuberculosis s„c n -nod,,-,,,, 

modttKd >equcr,ces may be prepared and tested as.nu 
■or examole. tbe represents, ^ 

- . — tot „,„,„,,, „„„ im ,„,,,„ ^ 

' " -"™tv Aouid ^P<^ :hc secondary suture mu 

nyjruratnic nature or -n, li l l u r l lln u 

01 ' 10,>P ^ llUe » - —an.allv lin c„ a n«cd in ,„ n , 
■•nll-vmu .roups or an.mo ;lc:Js ^ r(JS ,-n, • " 
asn tl "•" rCStm - ,n '^™ve changes: ( I ) aia. Dro . ,|v ,Iu 
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5 sequence for ease of svnihesis nuHf «»J«g«cd to a Imker or other 

polypeptide mav be coniu^rf r« ^support. For example. a 

- conjugated to an immunoglobulin Fc region. 
A nucleotide "varinni" ,\- 

Ma ions. iucl1 mouiiications mav be . . 

°«„™ c allelic ' " " "a.uralh 

^e„ce spreI e rablv ^ lh „ a , ^ ? ™ *~ ~»~* 

» — ***** a, ;„ s , ab „ UI „„,, , ' PrCfOTbl> ' « "» *~ «* «- 

-leot.de seances gencraIJv ^ „ ^ ^ — Su = h — 
-"gent eondmons A , used her ' " ^ reCUe "^'^ *<**™ under 

• - ° SD ' S: ^^"u at ^ r oX ssc () _ " 

■<"Kmed bv two washes of ~.) ^m,ght: 

' ' ' ° m,nut « each :n !X SSC 0 "" SPK - - 

-° vvas ' le s of .10 minutes each in o.'X SSC ') snv ' ' ^ 

In J rL ' iatcd ^P"' combination ooivpemides , - ■ 

^mbmuuon noivr e n tlJ(; - , .-^pennac, Jr , ^closed. A 

Sln - Jt ,raM " ne 

ounces ,v h ,^ re ' ' """"""i .unertui,^ 

-omed bv of a , nk ' * * ,nterVC,,,n S ' im,n <> or mav b e 

-nmpene:;; poivneptides 
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*~ " ** *• m - an,«„- CS cba„ s e and re v„„ phase 

ctagn.p,,, Punikd an „ 8ms ;m ^ evajua(sd b ^ ^ 

appropna K lmmunc re5p , lnse (ef ^ ^ fof ^ ^ .^^^^ 

methods descbed h ere ,„. lmmulogcmc ^ may ^ ^ ^ 

3 O.I tC =ta qUK such *, M.t.onal Edman chemistry. & e 0dmM md B „ g £ „ j 

Biocham. 50:116-132, 1967. 

Immunogenic antigen, may also be produced recombinant us.ng a 
DNA sequence that encodes the antigen, which has been insened -nto an expression 
vector and expressed ,n an appropriate host. DNA molecules encoding soluble aniens 
■0 may be .soiatcd by screening an annronn.,,, v, lubercu[: ^, , v „ 



pn-55ion iibiurv with 



"** spec,M > **» «w* * ^ DNA 

-fences <„cod m? a„,i ? e„ s m;ly ot ^ nM bc ^ ^ k ^ ^ 
screerang an appropnate , uber „ los „ „ enomlc „ cQNA ^ ^ ^ 

on^ned fro,,, pa lien!5 llUic , cd W1[ ,, „ ^ ^ ^ ^ ^ 

penormed »n s ,cchn,a ues k „„„„ I0 of ordln ^ ^ ^ ^ ^ ^ 

dCSCnbed "' S ™ br ° 0k «*~ «"»"*' A Ltoravn, ^ Cold 

bpnng Harbor Laboraioncs. Cold Spn„ s Harbor. NY. 1989 

—mn, an :, p p r „ Dn! ,, c „ niher ^ m .DNA „r ,enom,c DNA liorarv :or UN, 

OT "" t ° *" '° «i«l,o„d, s ao m ,d ,,„ ra ^ ., mmi , JC ,„ 

„, lsolaIcd s „ iul>le M „ ?ens DceTOc ii|igonuci<;oi]de sMucncc5 iw ^ ^ 

" Kb 1 " JeS ' S " Cd " d «« ™ ™v „ cenorTncJ Js 

^nt™ „„ r ,,amp, c! „ s,„,„„„. , . : „„„ K , . J „„ ra((jr , w . 

— .>.,v,n.a, .a,„ r,a t „-R, mav als( , „ ^ ^ ^ 



z rr t ce,,s - - - — - - — « j 

d„ k; ^ screcns may ^ ba 

- o^„„„ r cells denved ftm ^ 

reacimry. as described above, anuwxly 

Re«ard,e S s orrt, ^ of prcpaialio „. ^ 
Pon,o„ s d,e,c wl descnbed bere.n , ^ may 0 r m „ „ „ ^ ^ ^ ^ ° 

0 induce an immuno^, n . 11 ■ to 

. . B v,urc ^"caiiy. the anticcns have the abiiitv -n 

-uce p roliferatl0n anchor cytokine prodncuon u , mleniron ^ 

w.l.. iniLneron-, and/or intericukm- ; ~> 
— , ,n T ce lls . K K „„, „ „ ||s md „ r mcrophaeesd _ |fr J m - 

- ,d,, dua , Thc of ^ for uw " 

"""T" r " POnSe " ' " * — reS ponse For 

^ — * - — P e pm „ 

cor, a ,„,„ g B cell, and-or maerooha.K „ , 

». h „ . . ' '"'Oiwi-fara-immujic individual , s onc 

«h<> is considered to be resisrmr ,„ ,1. i , 

„„ unKd T M ' *" J "*I"™" "f miosis by v,„„e of ha„„ a 

mountea an erfecriw v b 
• response ,„ „ ,„„„„„,,„„ 

disease T^-i, • " " ^ °' an >" ^ - -rnptorns ot n^uios.s 

aisLdie. i ceils. \R ce k u ^n. j 

L " ls and ™«-™nna C es denved from 1/ w<W 
:mmuns individuals may-.. , •""^.«<n.,- 

^-■■^™,„etn»dsi^„u 1 uo.h„scn,- lJrd]nan . sklil . I1 

-T.. ror example, a preparation 0I i> BMt . ... " 

_. 1V K ... KMC penpneraJ Mood mononuclear cdh . 

^ r«e employe,! without -.rh.-r ■ . 

generally be Drenared • , r ' ' ma > 

1 ^ :miirop Laboratories. W, T,,iu nt 

. lir ,. ;j , . -lis tor ase ,„ : „ c 3,.^., Jescnbcd nere!n 
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PBMCs from .U Wc^Wimmune individuals with mycobacterial proteins for a 
penod of 2-4 weeks. This allows expansion of only the mycobacterial protc.n-specitic 
T cells, resulting i„ a line composed solely of such cells. These cells may then be 
cloned and tested with individual proteins, ustng methods known to those of ordinary 
5 skill in the art to more accurately define individual T cell specificity. In general, 
antigens that test positive in assays for proliferate and/or cytokine production (u>[ 
interferon-y and/or interleukin-12 production, performed using T ceils. NK cells. B cells 
and/or macrophages derived from an M tubercuhsis-inuome individual are considered 
unmunogemc. Such assays may be performed, for example, using the representative 
10 procedures described Mmv immune,™, ^ ... ^ 

us.ng similar assays, and may be present within the polypeptides described herein. 

The ab.hty of a polypeptide (e.g.. an immunogenic antigen, or a portion 
or other variant thereof) to induce cell proliferation is evaluated by contacting the cells 
(e.g . T cells and/or NK cells) with the polypeptide and measuring the proliferation of 
5 the cells. In general, the amount of polypeptide that is sufficient for evaluation of about 
10' ceils ranges from about 10 n& -mL to aoout 100 ug , mL and preferably is about 10 u 
X*mL. The incubation ofpolypept.de with cells :s typicallv performed at 37°C for about 
S1X d3VS F ° il0W,n ^ ™ubauon ^ith poivpeptidc. the ceils are assayed for a 
proliferative response, which mav be evaluated hv methods known to those of ordinary- ' 
0 skill m the an. such as expos.n, ceils to a pulse of radiolabeled thym.dme and 
measuring the incorporation of iabei into cellular DNA. in general, a polypeptide that 
results in at least a three- fold increase :n ,roi iteration above background „.,. the 
proliferation onserved tor cells cnlrmeu .muiou, po^uue. t , consider to .me to 
induce proliferation. 

Hie abii.tv of a poiyrept.de :., M.muiate tne production of .nterteton- , 
• U,d,,r imL,rlCUkin - ;: m ™> »* -aluated * contacting fhe ceils w„n the 

poivpeptide ana measuring the level of interferon-, or in.erleuk.n- ! 2 produced by the 
h i:enerai - :he •"™ ,Un! ,i: roiyrrptr.ic that :, : u: r,c:ent tor fhe evaluation of about 
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bead or a biodegradable m ,„„ sphere . 3UC „ M ^ ^.^ ^ ^ 
No S 4.8,7.268 and i.075,09. The , n c u ba n „„ o[ ^ ^ ^ jf 

whach may be evaded by mrtods ta0M , 0 te Qf ^ nay ^ ^ ^ ^ • 
an ^nzyme-iiaced ta,,^^, ^ (EL]SA1 m m ^ ^ onL ]2 ^ 
Way such a, „ Msay pro „ fauon rf , ^ h ^ _ ^ 

^. »A. ,„ th e product „ f a, ,ea st 50 pg of ,„ K rfe ro „- r p „ ^ 
supernal, , contiunlng ,„,,„, j ^ pef ^ ^ ^ ^ 

nroducti 

' ' i^WK** «m <tMw> tto producuon of ji least 

macrophages or B ceils (or per 3 v l.y PRvrr, , 

P^r x 10 PBMC) 1S cons.dered able to stimulate the 

production of IL-12. 

In general, .mmunosen.c a»„ s e„s are rhose ar,,,^ rha, s „rr, ul a,e 
pr„.„era„„„ Mor ^ fa ^ ^ 

prodacon, ,„ f «,„. ^ cell , g cdk ^ ^ ^ ^ ^ ^ ^ 
■ o, 1/ irid , vlduals Amo „ g ttec |mmunogcmc m|i!cns 

p..-v Dt , lldes „av,„ 5 iupcnor ;he„« mt ^ mav he ^ „ " 

T " * ™ *• — — - » *e Percent- „, 

.he..pe U1 ,c P , op e„,e S ao , sumuhK ^..^ ^ ^ ^ ^ 
— - c=,„ de„,.d ,„ m mme , hln ,„„„ , Wi „ ;nd]vidua|s ri ^ 

" " ,m, " U " C ' ""^ - to - :„„ .„, tl f,» llv Jue ,„ 

,,K) -' L ' Ult, ^"' s :h;lt -«'"^- a response :r, a ;) ,<, h 

"'ercentaye ot f cei' \k" u 

. " ' Cl,,i ' u,a " r ■naerophage prcoarat.ons irom 

Xl "^''•^••vi«v/.v-imniiine miiivuiuals .with . in. , , 

iwitn a low mciuence ot responses :n cell 

'reparations irom other individuals) have sun,-™ ,u . 

ru\e superior therapeutic properties. 

Antigens With snni-rnr 'n,., 

.upenor .nctapeuHc propert.es may aiso iv .dent.tied 
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T " °" CXP "' m5mal •** « M bdow. EBicacv mav „ 

de,crm, n ed teed „„ ab ,„, ^ m „ gen M provide m abom a 

expend ,„,«„„, SuioWc SJpcnmm ^ ^ mdude ^ 

primates 



Anti saK havrng super , or diasnos[lt . pfopenies mv ^ 
■denied based 0„ *. a 0 „ 1Iy «„ dWt a ^ nse ,„ „ , nwdcnnd sfan pcrfoimcd 

**** "* aC " Ve »*~ *> * » » « P-*™- « , MtvjdwT 
«. « »o, Wta* * Sk ,„ , e „ s may ^ „ pmoTmed 

0 descnDcd below with n ™- t - , 

" — »» ai «:asi ; mm induration considered positive 

Immunogenic portions of the antigens described herein mav be prepared 

and identified usino well bnu-n , u • 

r , = ' such as those sumnl an,ed in p au| 

"°™ fo °- 3d * »~ PP. *W« and references =„ed 
.here,, Such ^ mdudc sc:Mmng ponionj ±q ^ 

<or , mmra „_ c e„,c properr.es. Tile rc ? rese„ 0 „„ p r o„ f era„c,„ M d cin *„e produc „ 0 „ 

OTbed hm '" ™ y • — * b = - — screens M mun , smc 

pon,„„ UI a Poivpepnde is a p„„,„ n ,h aL mthm sucl , repi „ emM]vc ^ 

- _ nap™, , r(f . proliia -a, lt ,„, lnler!eron .., prod „ cm , n ^ 

producon, „,a, . sahs.annallv s, mli ar „ ,ha, aencra.cn „ ,he : uil kTOIh Jra „. cr ;p " 

- " «»d, aa, ;mmu „ n ,e„,c p oni „„ „-„ ^ ma> „ „ ^ ^ ^ 

and preterablv about I 00°^ ,w 

™~l pr ni „er a „„ n a s sav desenhrd aerena An lmmun „„„ K „„,„,, ^ ^ , r 
~. ,«e ,,e .„ a, _ _ a„ U prc.era.v a,, 

' >'- ^..'.mene,,™,:; ,,„ ■ „ , n , 

— . .Hitmen ;n 'he 

mouei a.ss.-.y desenbed herein 

I'onons and other vananus of U ,,,,,,,,,, ;UUl , cns may be 
;\ " VnmCt ' C ° r rCC ° mb,nant S >^'- -iypepndes hav mu than about 



! 1 )() amitm jck 

ma\ -e generated 



inU ^ ncr:,ll > :'^er than .m,M:! , 1C: ,, 



a 
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technics, such as *e .V lmM Sl)lld . ptec synto ^ wh 

ir** addsd » - — - *. **** J r;i r 

196,. E quipmem for automated ^ poivpeot.H, 

— „ f „ m ^ s, h B APPliea Bl0SysMM , 

CA. «. „a y * opera , ed a£cordmg M , e _ fws 

-« ™.v ^ prepared ^ ^ 

» ^^dd^ s„e-s P „, lic Sec „ ons of fc 

R.-roraoinam polypeptide containing portions andor variants of , 

W - : """" ° f CCh — - — » - *.m„ * a 

— p.e. sapc™ from smmb|c hos „ vecior _ 

r tv^ ~* - * - — - . — 

: F °" ,>mnS • * ~« n-y be a PP „ed , 0 a su„a b , 

more reverse phase HPf r . - oni or 



fc a, m ; ^ — — t "°"» • *~ sk ,H ,„ 

" e ™'° VCa » <«™ ™~ ***** « ,h,s ,„™„ 0 „ 

iAprcssion mav be achirvpri in , ^"iun. 

— «* „ ™re S s,„„ ,, CIO , ,„ n , a| „, n¥ , DyA ; 

- ■ -o.,i ^:j.s iTTinkueii are ■ < r >,, 
ceil i.ne .such rs COS rM( , Mv , 

" ' U ' ■ ' A ; ^' ue:K " c ^ -M^.sseJ !n ih.s manner m 



i:; ^--rierai. n-ard'iev. .„ .(,.. [TV ,i„, f .. 
; " ' ri'-paration. trie polypeptides 
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" ab ° Ut 990 ° ^ ID C ™ ~ embodiments ^ rf . 
^ bd ° W - «" " ~ PO.vpep.des a, mcoiporated J^T*. " 

compositions or vaccines for use in „„ Pharmaceutical 
cmes lor use m one or more of the methods disclosed herein 

In certain specific embodiments the «,hWr 

3 Polypeptides composing at least an , m mVenQ ° n djSd ° SeS 

an H u ■ mg at ,ea5t 'Himunogenic ponion of a soluble A/, iuber-ul^ 

-tn 8 one of the fol.owm, N-tem™, s „ or . _ ^ 
d-ffers only In conservative substitutions and/or modificanons: 

<* A ^ Pro -^^A i p-Ala-VaJ-lle-Asn-T r ,. Thr . Cvs .. Asn . Tvr . Glv _ 
Gln-Val-Val-Ala-Ala-Leu; (SEQ ID No. 120) 

• - • — y.v.et-Leu-Ala-I.eu-Gis-mr-Pro-Ala-Pro- 
Ser; (SEQ ID No. 121) 

(O ^-Ala-Mct-Lys-P^Arg-Thr-GIy^^ 
Ala-Lys-Giu-Gly-Arg; fSEQ ID No. 122) 

^^--^-^y^ro-Giy-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Giv- 
i'ro: fSEQ ID No. 123) 

<«> ^-"-G'y-Ser-Glu-Ser-^Glu-A.sp-Gln-Gln-Xaa-.Vla-V,, 
(SEOlDNo. 124) 

-ft •^-^-Giu-Ser-no.Se r -T lir -Xa, i -Glu-Xaa-,le-V ;i l-P r o; ,SEQ , D 

^o. 125) 

f'ro-Pro-Ser: (SEQ ID No. 126) 
^iv: \Si:o in No ]2~; 

^P-Pro-Aia-Scr-AI,-i'rn-.Un-Va ! -i'ro-r- nr . A ; J , ,. - . 



n 1 r-Vr.i^w.cu-Asn-S.r.i.,u-Ai. 1 -AM.-Pro-A S n-Va 1 -Se:-Ph e - 
Ala-Asn. (SEQ ID No. 12Xi 
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"> ^-^-Glu-Scr-GIv-AJa-Glv-Leu-Gls-Glv-T^Val-GIn-Ala- 
Gly; (SEQIDNo. 136) 

=nc^ g te mIlgt „ , denIlfled _ (g) above ^ ^ ^ ^ ^ ^ ^ - 

> MWd: e „, 0 ded b , SEQ ,D No 52 is provided ,„ s j7q ,„ No „ ^ 
-J— ^od,„ g „ nem defimd „ ( „ ^ , s prov . dEd io m ^ 
ded™«, « acld seqKnct is prov , ded ,„ S£Q |D ^ ^ a ^ ^ 
co^„d,„ s ,„ Mdsen (d) abovc , s prmidcd m m ^ ^ 

«~P«** .o M „ 6 e„ ,o is =ro>ldl . d , n S|;Q , d No : _, ^ , 

•» cn^poncns „, ^en ,„ is pr „v,d e d ln SE 0 m N„ no. .„ , 

^ uutcu oinmQ acia 

sequence is provided in SEQ ID No. 100. 

In a nanher specific embod.men,. t he sub.ect mvent.on discloses 
olvpepudes comprismg a: leas, an m™ g eni C portlon ofan , ^ 
H- g one ol th e foU^„ g N , tcrminal fences, or a variartt w** differs o , 
l > m conservat.ve substitutions and/or modif.cat.ons: 

X --^-"e-Ala-Tv r -Xaa-TTu-Th J -Ala-Gly-ne-Vai-Pro-G. 

[le-Asn-Vai-His-L,u-Val: (SEQ JD No I or 

_ XSn " PrO - t)r0 - A ^- p -- iil --"-Xaa-As D -M 

ivr-Pr(,-Giv-(.,iv-Arg-Arg-Xaa-Phe:(SEUlI)No -0) " 
0 -vnerem Xaa may be any am.no ac.ci. prcterab.y a cystcme res.duc. 

I" other specfic emboa.ments. the sub.ect .nvent.on discloses 
,-vpept.des com Dn ,n, at leasl an !mnil „ logcnic ^ ,„ 3 ^ , 

^■uuencL-s encoded hv . - n m.. r )v \ , 

■ *u. ->VA .cuuences. or 1C! „ NA ;Cl}liences subsUmnallv 

■■ l,m »'''!?"us: l . a sequence m M.onni 

I" *nher socc.rlc cmho,,,,,^, the . sub , e , 
poiypept.aes a.mpnsirm it le ts; , „ 
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Nos.: 26-51. 138, 139, 163-183, 189-193. 199. 200. 201. 203. 215-225. 239. 240 ^-T- 
247. 253-256. 261-276. 292. 293, 295-298 and 303-342. (b,the complements of such 
DNA sequences or (c) DNA sequences substantially homologous to a sequence in (a) or 
(b). 

In & a specific embodiments discussed above, the \f. tuberculosis 
antigens include variants that are encoded by DNA sequences which are substantially 
homologous to one or more of DNA sequences specifically recited herein. "Substantial 
homology." as used herein, refers to DNA sequences that are capable of hybridizing 
under moderately stringent conditions. Suitable moderately stringent conditions include 
n prewashing in a solution of 5X SSC. 0.5% SDS. 1 .0 rn.M EDTA (pH 8.0); hybridizinP 
at 30°C-65'C. 5X SSC. overnight or. in the case of cross-species homology at 45T. 
0.5X SSC; followed by washing mice at 65 3 C for 20 minutes with each of 2X. 0.5X 
and 0.2X SSC containing 0.1% SDS). Such hybridizing DNA sequences are also 
within the scope of this invention, as are nucleotide sequences that, due to code 
5 degeneracy, encode an immunogenic polypeptide that is encoded by a hybridizing DNA 
sequence. 

In a related aspect, die present invention provides fusion proteins 
comprising a first and a second inventive polypeptide or. alternatively, a polypeptide of 
:he present invention and a known .1/ tuberculosis antigen, such as the 38 kD antigen 
) described in Andersen and Hansen. Infect. Immun. i":2481 -2488. I"8'f iGenbank 
Accession No. M30046 , or ESAT-o , SEQ ID Nos. 1 03 and 1 04), together with variants 
of such fusion proteins. The rus.on proteins of the present invention mav also include a 
iinKer peptide netween the first ana second poivpentides. 

A DNA .ecuence encoding a rusion protein ,>f :he present invention , : , 
constructed using Known rccommnam DNA techniuues to assemble separate DNA 
;eauenccs encoding 'he first and second polypeptides into an appropriate expression 
vector The 3' end of a DNA sequence encoding the first pohpept.de is ligated. with or 

without a peptide ankc v - n I ■ r w - 

1 • ...i „[u Oi a ^ N. s. -.equence encoding tne second 

poivneptiae >o rh.it ^':e -eau'n" trirr>.-< ^' . . • 

• ' •-' ,u,,, ~ !rjn:i ■ e..ue:uv : ire :r, nna^c •.» :>»mi!! :i;RN \ 
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translation of the two dkm 

h , , e *"° DNAl W« mo a smgfc fusion pro , cm ^ 

h*** ac tmt> . „ f , )0 „ ta ^ , he ^ 

A peptide linkc Kquence may ^ , 

_ / lilt. IdCk Of nvdrnnhnhi,^ ~i , - , 

,- " ~ " " KCU rcs,aues ^ might react wul, the Doivpeptiue 

Junctional em to pes Pr-pt^r™^ ■ ,■ f u '>pepiiU(_ 

r~ r - im " ,u - - - - - - * * - i : 

v , - ^■^8-8.6.. L.S. Patent No. 4.935.233 and US Paten 

>«o.4., 3U80 Tne]m L-i. Paten 

„ • ■ m3> bL tr ° m 1 10 about 50 ^"ino acids m length 

- - - - - - — ***** * 

-..sentiai , -terminal amino add regions 'hit r,„ k 

. . "'° ni Lhat can be ^-d «' separate the functional . 

Jomams ana prevent steric :ntenerence. 

"* iigMed DNA — - - opernb, i inked t0 SUItabje 
r~ ^ ~ — — - — , ,em 

•" r '"' : •'- nr - !Ratl( > n "'^ ^ .>n,v ,re,cnt v to the "N 

^•und Poiv P ep, ld e " ' Cncodin " :h '-' 

or _ , , . '! ^ prL ' SCnt — P-des methods tor w one 

^ jb0Ve P °~ - — - or DNA mo,cu,e S encodtn. s.c, 

• • ' w ''' dL ' ' -° 'nca-e rroteetp.e TT-nrr:- 

^ aull: ' " "a^rcti.os.s : „ a patient. A, :is,-< 



;crs 



:nt 
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may be afflicted a disease. or m . v be free ofdelcctabIe di$case ^ ^ |n 
other words, protective immunity may be induced to prevent or treat tuberculosa. 

In this aspect, the polypeptide, fusion protein or DNA moiecuJe IS 
generally present wthun a pharmaceutical composition and/or a vacc,ne 
5 Pharmaceutical compositions may compnse one or more polypeptides, each of which 
may cent™ one 0I more of the above sequences (or variants thereof) and a 
Physiologicaiiv acceptable carrier. Vaccines mav compnse one or more of the above 
polypeptides and a non-specific immune response enhance, such as an adjuvant or a 
liposome unto which the polypepude is incorporated). Such pharmaceutical 
J compositions and vaccina m->i- .,i„, 

" ^ ,1 - '"^c/L«i^^ JiiLmens. either 

■neorporated into a combination polypepuae or resent within a separate polypeptide 

Alternatively, a vaccine mav contain DNA encodin, one or more 
polypeptides as described above, such that the polvpept.de is generated in suu In such 
vaccines, the DNA may be present w.thtn any of a variety of dehverv systems known to 
those ot ordinary ^ m the an., including nude, acid expression systems, bactcnal 
and y,ra, expression svstems. Appropriate nucleic acid expression svstems contain the 
necessary DNA sequences for expression in the patient (such as a suitable promoter and 
- i). dactenai Jell , er> . systems , nvolve ±e aUmmistratI0n o[ ^ _ 

nactenum .such as Baailus-<\ t ime>te.uucrr-„i ,n-u . 

"me.r L i, U Lrr.n, mat expresses an immunogenic portion 

■>« ^ Polypept.de on us cell surface. In a preferred embodiment, the DNA may be 
produced usm g a v.ral expression „tem ,. vaccm.a or other pox v.rus. retrovirus 
or adenovirus i. which mav invoke -he , t ' , i 

L Jlc ^ or a non-paihoeen.c (detective), replication 

^moetent v.rus. rechn.uues tor meorror-pp,- -w x . n , , ; 

■ )NA :ntl> '^n expression svstems arc 

wed known to those of ordinary >kii! n • . -w . 

•'' l)V " nm aiso -e -naked. " .,s 

described. lor example in ! 'im r ,., v 

■ • ■ . -i . ; \ ie wed 'n 
'° - P 1 ,le ^»kod DNA ma> be increased bv 

:; >at,ng which are etneenth transported into ihe 
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-vc. r„ r ^ of DNA cmodjng a ^ 

_ alner ,,,, or in , ^ ^ ^ dKcnM ^ * 



Rout* and fc,^ or ^ wd| ^ 

~r w * ,„„ K , 

10 suocutaneous). intranawllv,,. •„ 

. . . ' ' '■' — " °< oraiiy. Between 1 and 3 doses mav 

777 :or • ^ 3 *« - *„„, SKred M 

Alternate protocols „ ay U appr „ pruK for ^ ^ a ^ 

— orpo, yP e P „ de „ OK., iha , lvhc „ admms , rcd . a dKc . oed abow ^ 

7 """"" ,n ' C '" 0 " « - « — '» S™nL to M „ um „ r 

P« TO e„, ,» . oo, ,„ produccd ..„ „„ bv „ ^ fc , ^ 

™ Ptcc^v lrora aoou , 1WJ „ ,„ JD0 „, , ^ Su „ abli . josc £ 
, ™ y S "' UbiC ^ » ■h«o 1 -.« liMI , sk t„ ,„ „ mav 

:" y Je " ena,nc - ~ - — - p« mh 

- ^orr.c. ..^-rcraniv compnscs water. , a j-ne. aicohev , • •„ , 

^7. 7 0se - ™ ! — . * OT p,o«„ 

. : o,Ma., L •aiac-e- max al.s«» rx- ,m D lovcd xs 
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Any of a variety of adjuvants may be employed in the vaccines of this 
invennon to nonspecirically enhance the unmunc response. Most adjuvants contain a 
substance designed to protect the antigen from rapid catabolism. such as aluminum 
hydroxide or mineral oil. and a nonspecific stimulator of tmmune responses, such as 
5 hp.d A. Bonadella pertussis or Mycobacteria tuberculosis. Suitable adjuvants are 
commercially available as. for example. FrWs Incomplete Adjuvant and Freund's 
Complete Adjuvant (Difco Laboratory) and Merck Adjuvant 65 (Merck and 
Company. Inc.. Rahway, NJ). Other suitable adjuvants include alum, biodegradable 
microspheres, monophosphoryl lipid A and quil A. 

10 In another aspect, this invention n m ,.-«™ ~„.u„,j.. _ • 

- f."....^..,...,^ IW osihl; uiie or more 

of the polypeptides described above to diagnose tuberculosis using a skin test. As used 
herein, a "skin test" is anv assay performed directly on a patient >n which a delaved-tvpe 
hypersensitivity (DTH) reaction ,such as swelling, reddening or dermatitis) is measured 
following intradermal injection of one or more polypeptides as described above Such 
I, mjecuon may be achieved us.ng any suitable device sufficient to contact the 
poivpept.de or polypeptides with dermai cells of the pat.ent. such as a tuberculm 
synnge or i mL svnnge. Preferablv. the reaction ,s measured at least 48 hours after 
injection, more preferablv 48-72 hours. 

The DTH reaction is a ceii-mediated immune ,esDonse. which ,s <^ ater " 
m Pal,emS that ^ beCn ^ Piously to the test antigen , ; , . the .rnmuno.en, 
Portion or the polypeptide employed, or a variant thereof, Ine response mav be 
measured visuai.y. using a rule, in general, a response that ,s greater man about 0 < cm 
:n a.amete, preterabl, greater than anout : , ,,: :it diameter. , a nosune response 
• naiCat:VC :U ° erCUi0S!S M, ' n ^ - may no, be manuested ;U1 actlvc 

T,K ' p ° iypep,,deii 0, this ■•mention are preferablv formulated, tor „ se :n a 

>k: " !CSt " ? ' ,annaCCUUCal L ' om P^nions containing a polypeptide and a 

phvsu>ios;icalI\ accentaHl- o-.— „.r 10 ; ■ 

; ^-M. as jesenne-.: .uv.e Such composition,; tv P ,cailv 

-ontau; one or rnorv .i; • :lnoV! . ._,,..„,., 
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Preferably, U,e earner eroploycd , 0 sudl phannaccuoc , ^ ^ ^ 

sojuuon w,d, appropna.e preserves, such as phenol and/or Twee,, 80- 

In a preferred emboctaa,,. a p„lypep ti de employed ,„ a slur, IBI , s of 

penod. !„ e e„eral. a Mypep,* , s a , __ , ^ fa . 

™= polypep.de , s preferaWy brokcn ^ by ^ ^ f 

,n JK »o n ,o aj.ow preset ,o T-ceU, Sueh poiypep,,^ raay ^ 
or raore of rhe abovc xqm ^ ^ _ _ 

sequences. 5 - 



The rollow.n, Espies arc offered by way of i, luslraUon ^ no[ , y 
way ot limitation. 



EXAMPLES 



EXAMPr.F_[ 

Th,s example i, lusn _ tcs :he „ f tf ^ 

polypeptides from culture ^ltnr t > r\,i i 

ture .Itrate. Lnless othenv.se noted, all percentages ,n the 
following example arc weight per volume. 

" WtW ' WV in7R - VRC No. 25 177. or IPTRv VI (V 

No. ..-618) was cultured in jtenle <}.\S n - i-i - . , 

,T1 ^" 1 '" < :or fourteen davs. I'he -n__,, 

r^.r" "" cred ,tem " *• hu,k . , ; 

bottle and NaN\ was addt-d •„ ti, . 

LJ ,ht cuJture titrate ro a concentration of 0 04% r hc 
'HHtlesuere then placed lu , A ° L - coid r()um 



WO').r420- (> |iSW ll , 2()S 

bad be™ nraeJ W1[h etoo , mJ conmned a iMootDa 

12 L volume to approximately 50 ml. 

I"he culture filtrate was diaivzed inm n i«/ • , 

avatiable BCA assay (Pierce. Rockford, IL). 

The dialvzed culture filtrate was then lyophiHzed. and the polypeptide, 
suspended in distilled water The noIvWrf ,• ^peptides 

„ hi ffTi . A Pol.vpept.des were dtaJyzed a gaJns t 0.01 mM 1 3 

o HtnsihvdroxvmethvlVmethvlam.nnln^.^ „„ _ _ _ " 
, n , r „i •■ • . ' tois-ms propane buffer), the" 

-ng g rf profils , on ch P^mica 

4 6 mm , 100 mm ,P • * " ° M in,m column 

™ x ,00 * Prap ,„ B,„ SysKm , Franingta . MA j e , uillbracd in ^ 

Srad'cnnn ihe above buffer svstem Th i , M iNaC 

„,,,„ „ m ^ ™™ *<™ ™ ™«"ored a, a waveiengtf, 



0,8 i0 "™ n '*«"■ Vliif »-- -=00 A„ ?strom nore «. < wr0 „ 

panicie size (3.9 x i>f) mml n, " ^--micron 

anear r - ^ ^ ^ * e with a 

• InLaf ^ d,cnt !ro ™ "-<'0% dilution buffer ,0 .<><„ r F y in 

w,s ) -< i ^etomtnle). n,e flow rate 

ml/mmure ana -h- WP'r .,, 

" P - A - 1Ucnt w * ^""ored at :i 4nm . j.v actlons 

-'".aininu the ciuted :,o,vneptides v, ,1 < 

..utiuus .s,... .elected :o maxima the nurm . , 

maiviauaa samples. A D p r „xima,eis ^O 0 n. JPI ^ . oKn • 

- P-ni..a polypeptides were obtained 

c .„ .... ^ * »"* "» ™^ «* *c ™„v ,„ ,„ ducc r . 

^eil promeration :n PBMC m-n^, - ' 

.->.w\_ preparations i v PRvif. ^ 
. ' BN1C ^ from Jonors known to be PPD 

sK:n :cs; Positive .ma whose l-o-ik u . r , 

wijd^. >o!ljp!c proteins *- 1 — 
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Policies were added m dupHcate at concent of 0 5 to ,0 , 

davs of culture in 96-wcl] rorm i K Mg/mL AAer s « 

well round-bottom plates in a volume of m 

was removed from each »„.ii f „- ^ • of me <iium 

m each weli tor determination of IFN-y | evt .k a ^ 
The plates were then pulsed with . ^ Wow - 

Prions that resulted m „ ,. r . 8 ' gas K-nullaiiM counter, 

raited in proliferation in both replicates three few 

proliferation observed m cells ,,,,„„,. f ° ld grea,CT "»» die 

cells culru™, ,„ ^ ^ ^ comidm(j ^ 

'ELBA, PUSaI" " "*,„,. _ 

m ™o m temperature Tie „h,« , " non-fat dned milk for I hour 

, "" K ***ed s,.x „me s in PBS/0 ™- TUTrv , 

SMPI " ' 2 ,„ cu.ture medtum in ^ ~ " ""^^ 

ovemt^tatr.^^^ ™ 

— *™ di,,, , : , „ ;;;r;rr d - • - 

each we,!. r„e plates were the, „ A °™ ^ was ^ •» 

■'•our .neuron at room temncrnn.re -he . . ^ ^ 

uuie. k nc plates were washed mn T\<n 

* - "W* after :o m,„ w„„ , N - , fi WHn " '"^ 

» ^ „ m TO < 70 „, r ac,j om,ai *»* - 

- nm ds j reiercncc wavdemuh , 

" b0 ' h ™ P™* an 00 , w ,„, d , mK . , . * *» 

r i - rtJtcr than *e mean Of) from .< . 

udor.s. AereaMismereu positive. 



^ Ue " C:n "- :h < P»l>Tem«fcs were mdn-dual- dn- , 
H! m erAppi,ed B.oSvstems 0 lvn ■ ^ 
filters The niters w, th , , ' ^ CA ' ^ ^ rlhcr 

P-'vpeptide w„e loaded onto :1 Perkln Elm ,. 
B.o>y st ems D.v.sion Proe,se 4-p nr( , Elmer Applied 

-m n , emvn , . " Pr0tem ~ po, yp e ptidcS were S e quc „ced 
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Using the procedure desr nh f >H 

C. ^^-A S p-A, a .V^„e,^- Ita .T lr .X M . As , Ty , Glv . 

Gln-Val-Val-Ala-Aia-Leu^SEQIDNo 54, 
(b) ^^■OI U .Se,G,v-Me [ -L e ,A,a-L= u . G „- Tb .p ro . A , a . Pro . 
Son (SEQ ID No. 55) 

<-=> A1 **^^^P»^TI^^^ J>rai ^^ 

Ala-Ly^lu.Gh-Arg; (SEQ ID No. 56) 

«> ^•^c ys -P ro . Glv . Gln . Pro . Phc . Asp ^ oA| 

Pro; rSFQ in v 0> ^ 
(SEQ ID No. 58) 

<0 ^-^^'u.S e M,.S,.^x aa . Glu .. Xra , le . Va , p , o; ,„ 

No. 59) ^ ^ 

'8' AS "- P ™-«-Pro.A,a.p ro -P ro -V. n ,. Pro . Thr . A ,, A , 1 ,., h .^ 
Pro-Pro-Ala; |SEO ID No 60) and 

Giy; (SEQ ID No. 61) 
wheremXaamavbeanvammoacd 

An additional amigen was , oJaled _ 

PUn,,Ca0 ° n ^ * 10 * P-ciure aescnBed above 

tn.ct.on comprising a m ,xtu, of an„,„s - rom , he ~ h °' * 

P— , aescn^. ,, , mthd 0n „ ' he ; hr0ma,0graphiC 

-™^^^ ^ 

• x ' ^ih micron :k^j s;,- a) in. , ■ 

Pv *» '™ lT n,v, s ,„„ Moiief r~ H PU Pn, , 

were d„d from ,fc w„h , , " " S 

„0». TTV, ,„ " ' r ' mm "' 5 " ~ 

^jirn was monirorea at r ]rn T u . , . 



have a molecuiar weight of P 054 K i /k„ 

te^inai sequence: " ' ^ «» ^ N- 

<i' As P- P -^-Ser-AJa-P r o-Asp-Vai-Pn>-rh r -AJa,^a-Gl„-Gln- 

Thr-Ser-Leu-Lcu-Asn-Asn-Le.-.AJa-.Asp-Pro-A.p-VaJ-Sc.Phe. 
Ala-Asp (SEQ ID No. 6?) 

T> JS MM*. „ shown , 0 induce pro|iferadon ^ 

Performed using anion -xch-m • "" ,,aCllonal,on 

g an,on .xcnan,, cnrornatographv on a Poros 0 E colu mn 4 0 x 100 mm 

(Persepnve Biosystcmsj equilibrated in R,\ T ■ 

4 JUcl !n Blb " Fn s propane buffer pH s * p oivn ,. nr ; , 
uere eluted with a linear OhWn ^peptides 

rate ,f - ( . , ■ • ' " m 3b0Ve buffer ^ a, a flow 

- or , 0 n^,, co,um„ eiuent was monitored ar a w,ve,en E rh 

subbed * ^ iraCtl ° nS dUting rr ° m lbL ' ^ — - -.ed and 
(p ; " e ?naSC <~^> - g a Poros IC column 4.6 x ,00 ™ 

• Persepnve Biosystems). Poivpeoudes w , r ■ f - 

i. peptiULs were suited rrom the column with a linear 
graa,en, :rom 0-100% acetonunie ,0 TP U ,, n 

' ! llovv ™e o, > m l,m In . Thc d 
'■^a-s monitored at 2 ! 4 nm . * " 

Fractions coruainin" rh,. ..„„,., 

* - ,utul P»'ypeptiaes were :v«, D hiJizeu anr 

re-suspended , n 80 ai of aqueous <> - T 4 " 

aueou, .)., . J- A ana lurther subjected to reverse phase 

, ■ , , . ' 0mm " ^ ~ " ™ ' W — Anaivuca, remeeuia. 

a .mear gradient Y ... ; 

■ r ,, -^--i :^ v. o ;r.\j at a ;Iow ~uc or " 

,niR - f-uent was monuorec at::4 :m; " 

n>.e rraction with '•Mojom,.-,, n m , 

3 ; us , r! ,, r „ " AaS SL "™ a '"to one ;na,or Peak 

■ ' irhLr waller .'omnoiien'-, 'V,^,,.- ; 

re vaied thr- • ' °' ^ ^ ° nU> PVDF membrane 

ma, ° r nmds ^ — — ^ KJ. ZO Kd and Kd r,^ 

'" C •° ,u,w "- •-'^■r.a. iequence resnecuveh- 
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(k) 



Ala-Gly-A S p-Thr-Xaa-Ilc-T > T-Ile-VaJ-Glv-Asn-L t; u.TT 1 r-,VJa- 
Asp;(SEQ ID No. 135) and 

") AJa - Pro -Giu-Scr-Gly-Ala-GJy-Leu-Gly-GIy-Thr-Val-Gln,AJa- 
Gly; (SEQ ID No 1 36). wherein Xaa may be any am.no acid 
> Lsmg die assays desenbed above, these polypeptide, were she. t0 lriduce 
prohferauon and IPN- 7 prod uc„on in PBMC prepay. Figs. ,A and B show the 
results ot such assavs us,n g PBMC prorations from a first and a second donor 
respectively. 

DNA sequences that encode the antigens designated xs ,a,, (C ,. fd) and 

0 aDove w ere obtained bv scr^mn., . , , . 

= ~ --^wnv. iuucrcuiusis iiorarv usin" end 

com™*. .« ,„,„„„„.,„ codon bl=;i ^ scree „ perfomcd iis iK a 

V-QIDNo. ,0, T,,c ? ol }pcpl!ll e,„cod,dbvS ; :OIDN„. ,0, is provided in SEQ ID 
No 10. m pcn ; )mlcd using , pmbs comspondmE k ^ ^ 

,de„„„=d 3 a™ h,v,» g ihc icaucncc p[l)vldcu m scy n) No r ■ 

».n S . nr,*c co„,, nu „di n? ,„ a , ab(n , ^ , ^ ^ ^ 

provided m SEQ ID \o ^4 -.mi *h . 

' - M hL iCreen " erI0I ™ d with a probe corresponding ;o 

im,gen ,C " demif,ed 2 c,nne ^e sequence prov.dcd ,n SEQ ID No: 35 

' T " e ab ° Ve amU, ° 3Cld — —pared to kll0 wn om.no acid 

villains Mime ! ^.000 nrote-Ps - , 
' V!:R :ransia — Protesn .sequences ■ Version r- s 



'■" ^mr.iiican: r:omoji>L'!e- -nth- ::r:nr 



JC:vi scuucn ^ I'or antiucns in h, and ■ ! ! were actectc,! 

1 he ammo ac:d ^'LMit-nr.- Mr 

^uent. .or amiyen . n was lound to be homoiouous to 

J ..equenee irom A/ /^rac. The lull v, ( „h w ■■ 

UU lC ^ lh l/ ^ r ^' sequence was amplified trom 
_^ JNa M S:n! . :nc sequence -Stained :ro m < W XBANK 



• nis sequence w:^ vac: 
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TTc amino acd S c<, u „ re f or muge „ ^ f 

a sequence from A/ leprae. t0 

• !n the proliferation and IFN-v assavs descrihrH ^ 
. ■ «u>j>ayi> described above, usuiq three PPn 

positive donors the r^.ito ? h 1 u 

Xabk l: rePreSOTU,,Ve ™ ? ™ *>™ « ,„ 



table 




in T*. : . tha , „„, , ; s „ muiauon . 

jlL ^ id Umna ro ; , . 

andiowerSI.-and— br^h • 1 ^ one donor 

and i0r rhc *\vo :n rier dono-; n - v - ■ y 

- r^suirs indicate :ha ' " 



i:ucrrcron-v 



Production 



*nat ;ne.se antigen* .ire rip-im- y . 1M 

F ^ ' -™"c:ng Pronreration ana or 
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IlXAMPLEj 

Dessicated /i/ tuberculosis Hrm* mv r , 
* NNO soluaon. md ^ mmdv ™ t*™^ . addcd , c . 

^lemately homogenized and somcar^ • 

:o mil Tns pH - 5 »d h„ h l> " e raens ' v '-' lv ™*hed Huh 

o.s « P ,r:t - ro ° m — - - - 

" - donated via FPLC over a B "° 

'BioR^d). Fracuons were ^ ■ ■ Scale-Q-20 column 

P Torn .1/ .^errcu/w.vu-intecteci nanents which 

The most rcacnve !rjctl0n was 
PVDI- A Dand 3t appr0X)matdv S5 SDS -' AOt ed to 

85 kd was cut on, v.ddmi; the sequence- 

r"^'- 1 '^^— S ^ "> w^n Xaa 

'X* ,inv amino acui 

'^P^'notth,, nce 
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:03. ™ s ^ was fomd to c „ code , e _ md sequenM ided 

5 

EXAMPLEj 

This example illustrates rh^ „ 

» ™ sem bv S£ rr * dna ~ * 

Genomrc DN.A was isolated from the 1/ ,„a„ / 

DNA was nmdomiv sheared and , " ^ H37Ra " ™ e 

' ne3fed Jnd ^ d to construct an expression l.hr 
Lambda l\» e v nn .«:„ «Pression library using the 

XpreSS,0n s - vsteiI! 'Stratagcnc. La Jolla, CA) R lb h„ ■ 
generated against sector,. anti-sera was 

, ■ • pr ° teinS 0I ' ,hc Xl "J*™*™ strains H37IU IPTRv i 

hraman bv : mmunizins a rabb 1j/Kj - /R\ and . 

cultures. S„. S ™' * * " ™,„ ' 

protcm antisen in a ,o,al volume ot ^ m , KS °' 

: , r ■ --~w sa d IUVM , FmirweeksIatcr 

- » w „ _ " 

^ '"ii u\ CilUUM\ 'our lVf'"i-c ,, , r 

•™«n anugen. 7n, an U -,.. n . ^ ' JtLr W1, *> ^ 

c„ w ^ Hartw."vv' c : — 



t'-'-c/.v/rM/A dories .icciuciv: 
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s"z „ r :r ced * " ,tg - — * * - ,„ 

K - etaJ -^ £*p. .l/a* 1527-1 537 10QS p„„ 
m , , ' 1993 ' Rcprcsentauve sequences of DNA 

*«. ^ ^ ,„ _ ^ , dcnufied , n „ vcobacimum - - 

0 not m .1/ tuberculosis ThR A n \ * 

_ — - wiin a six residue 

TbR.V ThR , 1 N " S ' Sraf ' Can ' h0m °'°«- '0 

™ r ""°- ™* 13 - ™"' 7 ' «■* — ■ 

^ ' CW,,! !>BMC ™ D =n„ mtd „„ . 

:i r " C " U " vc — ™ _ r,*, prcpm „ on , ,„„„ „ 

PJiwm... art presented in Tables - md ; 

respectively " 
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In Tables 2 and 3. responses that gave a .stimulation index (SI) of 
between 1 .2 and 2 .compared to cells cultured in medium alone, were scored as - a SI 
of 2-4 was scored as as- SI of 4-8 or 2-4 at a concentrate of 1 ug or less was M 
- - and an SI of greater than 8 was scored as ^. In additl0n , me effect of 
5 concentration on proliferation and interferon^ production is show, for nvo of the above 

anngens tn the attached Fieuie Fr.r h nt u „ it 

^ ngUie - hor both P^ol'feration and tnterferon-y production 

1 bKaj was scored as *+ and TbRa9 as 

Ticse „,, ul ,s indicalc that tec so|ul)k n|[os ^ mdoce proutaioii 

0 individual. 



Tie 5cnom , c DNa iibnu, deenbed above, a^l m add,„„„„l H 37Rv 

bra °- ^ '™ ed «*• '<>* "f ob, a ,„cd from pa , raus wilh acovc 

tUbCra,lM ' S T ° ""»« *• "b«r. W H37Rv „ c „„ mic 

"NA was ,„, aKd . su b JC =,ed ,„ panial ^ ^ „, ^ „ ^ „ 

-press™ ,, braj ,. us,,,, lhc Lmbda espresaon svsKm iSmu ^ c ^ ^ ^ 

Three dirTerem ooois or'se*--* -irk . 

P ojs se_ ,ach contammu sera untamed from three mdiv.duals w,th 

active pulmonary or oieurai disease we- n, P( i ;„ ,i, 

-fc -v tlk . jscu ,n ttic expression screening The r>oois 

«~ d=s,p, att d TbL. mt M Tbll . ^ ,„ ^ ^ ^ ^ 

FN. ^ low rea c„v„v. TbM _ m ,d ium reacuvlIV and ^ _ ^ m bmh 

CliSA and immunobiot formal \ - 0U r'i n,, ,i 

un.i no,. i ,>i sera trom seven patients with active 

-.'Uimonary tuberculosis was also emnioved \ U •>.' -i- , .l I 

■ -vn •>! .i.c sera jacked mcreaseu reaetivn 

with rne recombinant }X kl) M > ( .r .,, llVM , . . 

;<_a pnospnaie-niriaint: protein. 

Ali pool, ^.j,^ „„,, , |vMk . Jnd asa| 

7Ra Md H37Rv MprKs '™ - m s™ * „ ai .. 



nun tied 
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Ttey rwo d„„« „ OT pttn(!ed . 0t ^ . , represemed 

or ,he DNA motafe .dennfa, m provlded , n SEQ Iu Nos ; ^ ^ 
■tec. TW-8-2 (SEQ. ,D NO ,05) « . ^ clone of ^ ^ ^ q 
> NO. 43, wm™, (Seq . , d nq 44) „ non M ^ goous ^ ta 

A^o add se^cc, for ta » !K heraMfter ldcntifjed _ 

™-M.™.,™-,» d ™, :ireshowninSr;QIDNos : n ; 

S e que „ CC5 ^ seqnencK ,„ fc ^ ^ ^ ^ 
above reV5ale d „„ Slenillcam homoIogies [o ^ ^ H _ s ^ ^ 

0 although weak- hnmnlnm^ . T . _ 

- ..... 1UUJ1U (U , Dn . w , oH _ u was rbund io , e homoJoyoui 

- a j4k D anugemc ?rotem previousiy ldentified m ^ paratuhercuiosn Kcc 

reading frame for the antieen FSat * , •, 

v r . g " ESAT - 6 P rcv '°u.sly identified in .1/ bovn (Acc 

No. L.>4848) and in \L tuberculosis (Sore'isen »i il w„ , 

, 99 .. u->ore.iien a al.. / wmi/ „ c»J:I710-I71 7. 

Probes denved from Tb38-1 and FbH-Q. both isolated from an FP7Ra 

^?;;: e r ed to M ciones m - «™ ™-» ^ * 

ro jS - 1 I-_.Th38- i F3.Tb3 8 -lP^ d ^8-IF6,SEO.IDMOS.I,:.U3.: 16 118and 



!F2. 



' Sh0 !D N '° S - 112 ' lnd 115 310 no^onnguous sequences from clone tv,8- 
' ' W ° readUly ' rame - S WrC * in Tb38-rF2: one corresoonos to Tb'TFL 

' • C V- ™ H4). the second. , parual sequence, may he the homologue of fb^S I 
and i > cailed Tb38-ii\' i SKO ID \'() its. n, . -i j , 

^^'duced amino acid sequence of:t;X- 

• i" • presented ;n SE'i) '0 \'0 

. ' " * Dn ' TC :"cmuleJ three Jone.s :n -ne 

■ nnrar. rbH-^-Fi SH • T> W, •,„ ■ ■ 

■ - (KS '-^ SEQ. IDNU. ]!.»,. ail of wh.ch 

-ire nitznly related seuuences 'o I hH o r; i ■ 

■ o lt>H-.> Ine deauced ammo acui seauences for -hesc 
tnrce clones are presented m SFQ ID NOS. !<^. and , , , 

f-unher scre-nir-.L; of the 
.iescriheci ,ihov t . . 



•■•'/>—: :// (M; .v genomic D\-\ iibrar. . is 
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above, one was determined to be tended to the 14Ki ,i„h 

« ♦ ■ • ^ I4Kd ^Pha crvsta] in hear shn^u 

protein previously shown to be present m U tuh ; ■ ' k 

^csent in A/, tuberculosis, and a third u^c j r 

s TbH-jj) are provided in SEQ ID NO- 118 141 • , 

gene bank as described alcove N n h . - ^ 
, wfl h ° hornoio ^ s — found to the 5' end of TbH-*>. 

whrch coma*, ^ rcactIve Qpen rcad]ng ™ 

'0 found to be identic! ,n ^ . . IDH-.9 was 

M „„ ci/iwu cosmid TZZ7. TbH-^ and ThW - 

■ b " » *e Pre v, ouslv , dc „ Ilfied „ . ; ■* ~ — 

refilin, a t ' " e0 ■"■ insertion element 

IS61 10 Md to the .V/ ,„,,„„,,„„. cosmid 

•o TbHOO wee f„ md . * ' " ' N " ^"^cs 

" pit r - : bed « ^ ***** 

Sera mcuoauons were rerionied -or - w ' 

aetected bv « < ' ■ ~ * B » u - *".bodv was 

detected ny addition of 'I-Iabe^d iw* v 

vanable Mm SUbSCqUem t0 for 

ana0ie nme<; ™S>^ from it, hours ,o - i j a v r,,,. , ■ ■ 

.ummamed ;n Table 4 ' ' the ™ obio, ^, 
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TABLF 4 



^ULSn Sera 

TbH-29 45 Kd 

"^H- 30 No reactivity 

TbH-32 12 Kd 

^ H -3 16Kd' V ' 



Human M. tb Anu-IacZ 



Sera 

45 Kd 
29 Kd 
12 Kd 

16 Kd. 



"""" " ~ " " — * ^ Antigen* 

immunoblot ts not sufficient. 

rCSUhS ° f T " Cd! ^ I**™* on Tb58-;. FSAT-o ana oth. 

tl , reco:nbl _ Mllgcns „ ?resemed :n labjes 3 _ o _ ctivei ; 
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TABLE 5B 

Results of PBMC Interferon-v Production to Representative A^mn^ 



Anngcn 






Donor 




I 


2 


3 | 4 


5 


6 


7 


1 * 


9 


10 


1! 


Tb38.1 






















ESAT-6 












- 










TbH-9 























T ABLE 6 

Si -V1MARY OF T-CELL RESPONSES TO REPRESENT \TTVF ANTIfiFS'S 



Antigen 


Proliferation 


interferon -y 


total 


patient 4 


patient 5 j patient 6 


patient 4 patient 5 patient 6 


TbH9 






. , i ^ 

1 


-t-+ 


13 


TbM7 




I 




1 

i 


4 


TbH5 




_^ i 

1 






8 


TbL23 






. t , i 
1 


7.5 


TbH4 








- connol 






0 



These results indicate mat both the inventive M. tuoercuinsis ami sens 
and ESAT-6 can induce proliferation mayor intcrferon-y producuon in T-cells denved 
rrom an M tuberculosis-immune individual. To the best of the inventors knowledge. 
ESAT-6 has not been previously shown to stimulate numan immune responses 

A set or six overlapping peptides covering the ammo acid sequence of 
the antigen ib38-i wis constructed using :iie method uesenbed m Example n. "I he 
sequences of mese peptides, hereinaner rererred to as pepi-o. are provided m SEQ ID 
Nos 93-98. respectively. The results of T-ceil assays using mese pepudes are snown m 
Tables " and 8. These results confirm the exi.sier.ee. and help to localize T-cell epitopes 
wimi: - rnjS -' enable .if mciucmi; prolih-nnion a;:c interferon-- production m I-ceiis 
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Studies were undertaken to determine whether the antigens TbH-9 and Tb38-I 
represent cellular proteins or are secreted tnto to. tuberculosa culture media. In the first 
study, rabbit sera were raised agarnst A, secretory proteins of M Hercules.. B) the know* 
secretory recombinant A/, tuberculosa ^ n 85b> C) recombinant ^ ^ ^ 
recombinant TbH-9, us.ng protocols substantially the same as that as desenbed in Example 
3A. fotal to tuberous lysate. concentrated supernatant of to tuberculosa cultures and 
the recombinant antigens 85b. TbH-9 and Tb38-1 were resolved on denature C els 
.mmob.lized on nitrocellulose membranes and duplicate blots were probed -using the rabbit 
sera desenbed above. 

The remits OMhlS anah'.Sis lisinu mnrrnl r, . j 

w auu auubcra (panel ii) 

against secretory proteins, recombinant 85b. recombinant 11,38-1 and recombinant TbH- ( > are 
shown in Figures 3A-D. respectively, wherein the lane designations are as follows 
molecular weight prote.n standards: 2) 5 ug of to. tubercular lysate; 3) 5 ug secretorv 
proteins: 4) 50 ng recombinant Tb38-i; 5, 50 ng recombinant TbH-9; and % «, ng 
recombinant 85b. The recombinant antigens were engineered with six terminal h.stid.ne 
residues and would therefore be expected to ungrate with a mobility approximately 1 kD 
larger :hat the native prote.n. in Figure 3D. recombinant FbH-Q is lacking aoprox.mate.v 10 

' D >r ' hC fU,MCngtn J: kD *e significant difference in the s.ze of the 

:mrnunoreacnve name Tbf-P) mnn^n ,« tu i - ■ ' 

° M Jnll " Ln in thc ivsate Ja ne (indicated by an arrow). These 

results demonstrate that Tb^8-I md Thw o , r » . m ■ 

in.* I and ibri-^ are intracellular antigens and are not actively 

secreted by M. tuberculosis. 

The fmdine that rb„_q is an intracellular antigen was confirmed ^ 
l!e:C:71!inmi: rCaCt,VltV n,M - 1 — —an 1 ceil clones :o recomoman, 1 bi.-O 
CCCre: " rV " •'" / ' m ' ; "" V;A ■• m,e!ns " inJ !J! ' !) * ^H-O-specnlc I' .ell cone , designated 
^ ; V " S gCnera,ed :r0m " BMr '^nealthv PPD-pos„ : ve .ion,, T:,e prontcatiy, 

r " SDOn ^ :31ThH -" tl ' Breton proteins, recombinant TbH-9 and a control V/ 

tuhercu'n)sis anti^ei 1 TbRi'i u-,c 4 ,» j- 

fc " ie:crmincd ™ ^neasunnu uptake or mtiated thymidine, as 

described ;:: Example ! A: >hown ;i F^ur- u.\ -I 1P .. - w ^ i 

1 - ur - i. tiiL d.<ne : : i n! <^ r^ponUs spec:!icalh 
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; d ; ,sna,ed PPU soo ' ,o) — d ^ pbmc *» • ™,os,„„ donor 

Tta. resul. confirm ^ b ^ ^ ^ ^ 



c. 



Genomic DNA was isolated trom M tuberculosis Erdmaji smiin. randomlv 

alkaline phosphatase. Im 

— ox;:::i;:;:r:: r;:; ^--» — - 

SE „ iD l: " ra,VC!V n — ™- -* — - X,,, , s pr „,, ded ,„ 

sequenc proved in SEO ID V... : CO ' 
Chanson «„ 5nccs , or Ille „ d0 „ es , heremai . cr 

Ai_ inc \;>-„ , r , shewn :n SiiQ ID \cs. ^ - f , a 

rc ip ^: : %e :> . w,th the 5' sciences ;i,r \p.; v„ r , VT) . n . 

v . - . ' ' ' Ullj XI 30 heln ^ shown in Sf« > ID 

W io>ioX. resDccnveiv. md the ,n,< - ■ ' 

• " **■ ' sequences :or XP2. XPV XP6. XP18. XPN 



and T4. 
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open reading frame encoding the am, no ^ 

it me amino acid sequence shown m SEQ ID NV 184 Th, 
reverse complement of TbH4 VP? r j 

m . ./ X?l '° md *» °P™ '^.»S fame encode 

-no a.d s e qrac e shoiM ,„ SEQ ,D N„ I8i ^. DNA se qu e„ce for W36 « fomd „ 

add ., , c reverse con , pkmcm cona|nms m readmg ^ ^ 
amino acid sequence shown in SEQ ID No.. 1 88. 

Recent, X P, ^ was prcparcd B descnM 3tove 

F '^ S SA ' B - ** - he™. recomblnM w 

was rouna to stimulate ceil nmiiw.r,^ , ^ 

' ' ^ ^ * ^ P™uctiun m I cells isolated from an 1/ ' 
ntnercuiosis-unmwc donors. 



**" PreSi ' 0n »"*>• "'"•'"y'S to Lambda Scree,, excess,™ 
system cNovagcr,. Madison. W!) as dc«-«h,.' kj . - 

. ron , . "' KL " Pw*d «» ob,a,„eu 

~~ -<i *. ™ *™ ,„ ^ „, h ,, ,„„,,„,„„ 

C °" ^ ,K — 3>L ,. Tb58 . . . ^ T! , HJ DpF „ mo 

..Ran. ^ ,„ M :te „ Dression jib[3ry ^ jescnbcd ^ _ ^ 

•he sec„„dan. anribodv ocmg , M ' 
, m , sphausc * eonmealed iv,,h .ilkaijne 

i wc:k\ -seen dono we— -i^r"> . . 

- -tit.. . ..mpanson .»i :i.e ietermineu cD\ •'» 

^ meSC a ° n " rCVCai ' J n< > — ' nomoiouies ,o .„ ()t the clones (heP ., a „. 
^rreJ :o as LSKR-.v. LSI* !: ! Si'R-' " - srP ^ , , ri , . ' ^' 

.^tK-_4. and I SFR-'^- r; 

t ' ' ^ ^termmed - cDNA sequences for LSI R-iO 

^ . ..M K - / provider :n Si-Tj Jf) NO- 

4 . resnectiveiv wni- >rT ... 



■imitu^ :ic:d ,r;;n 
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and LSER-27 are shown in SEO id vn ic/ 

. ■ , ^ D N ° -' J - 25 °- ^pcctively. with the correspond,™ 

predicted amino ac>d sequences bang provided in SL-Q ID NO ^7 , 60 r, • 

* ^ dettmmcJ 5 . cDNA sequencEs for smera of 

e ;: : 10 35 LSER "'- LSER0 - lser - 5 - lser -<' «>*-.« 
l Lt ; LSER ""- LSER - :o - lsero2 - tsER - ^ - 

r; t:;;:' r for lser " lser - 3 - lser - 5 - lser - 6 - lser - 8 - — 

CR b ER ""' LS£R - : °- LSER ' :: - LSER - «»* «, 

" -espeenvdv The daermmed full-i OTB1 |, 

" iCl,Uen " '° r LSER - » P~»*l in SEO ID NO: 2* The reverS c 

complement of LSKR-6 (SEO ID \0 ->o-i ^ r > 

k NO. _ ),) uas found to encode the predicted am.no acid 
sequence of SFQ ID NO: 294. 



■"lie or no immunoreacnv.tv with otner .rumens ,>r -h 

. ' " mi ^ ns " r ^ Present invention. Ranb.t an„- S era 

•vaj, generated auamst the most reactive 

n aUnc I ' aU,0n the ^hod desenbed in Example U 

i ann-sera was u^sed to sceen -m w , ■ 

" " U " nerC! ' l °" X «™n genomic DN.-\ expression 

-n.ar. prepared as desenned above Mace-ontn , , 

U - 1 -"° pna ' :t ' 11Juues "prcssm.; umnunoreaeuve 

" iM ^ ns vvere Pun.'ied ?hauem,d :ro r - cm- -v, 

. * : -uac, a.,. rescues an.: me nueieotide sequences 

■ w :uncrc:«asis Clones determined 



Ien jitJcren! done.s were nurpe i (V'-h. 

pur.r.ea ( >, diese. one was round to ^ TbRaJi 

' mj ™ ,,,unj w ^ * - - ^ 

■ '■ ^ h <-^'^ r.-terred to as KDIi ". RHF5. 



RHIFS. RDIFPi. rdik:: :<Dj! . . . k 



WO W -12076 |-( r IIS9'I032()8 

58 

corresponding predicted ammo and sequences being provided in SFQ ID No, 194 I98 
respectively. The 5" and y DNA sequences tor RDIF12 are provided in SEQ ID Nos 199 
and 200. respectively. No significant homologies were found to the antigen RDIF-7 The 
determined DNA and predrcted ammo add sequences for RDIF7 are provided in SEQ ID 
Nos, 201 and 202, respectively. One additional clone, referred to a, RDIF6 was isolated 

however, this was found to be identical to RDIF5. 

Recombinant RDIF6. RDIF8. RDIF10 and RDIFII were prepared as 

described above. As shown m Figures 8A-B and 9A-B. these antigens were found to 

stimulate cell proliferation and IFN-v production i„ T ceils isolated from M tuoerculos.s- 

immune donors. 

EXAMPLE 4 

^^^^ 

Prdtfin Dhrivativf 

An .1/ tuberculosis polypeptide was .solated from tuberculin purified protein 
denvative iPPD) as follows. 

PPD was prepared as published with some modification (Seibert I et al 
Tuberculin punned prote.n derivauvc Preparauon and ana.vses of a large quamitv for 
stanaard. Th» Ame rican Review of M.h^,,,;^, ,.,.„ , c 

M. tubercuiom Rv strain was grown ,or „ weeks ■„ synthetic medium , n roller 
sorties at 37-C. Rottles eonuurung the bacterial growth were then heated to 100" C ,n water 
vapor tor 3 hoUR , Cultures were sterile filtered using a 0.12 u , iltcr ^ the „ auid phasc was 

concentrated 20 times usmtz > in ■,.t ,tv 

- ^«nu .. - kD .ui-orr memoram- Proteins were precipitated one, wan 

11)15 ammonium sulfate soiution and e-»ht -r , ,rh 

; - n ' "'"^ A,th - " ammonium sulfate solution The 
™l.m t ^ ,,,,,, «« , rac!1 „, ;aIcJ „ ^ pMS „ !](mJ Armm 

■ IPU , 3 C I S , , . ;iH , mM: ',\ ,„„ MlI|( , tc MA im a B,„cad HP, C ',„,,„, 

Pcrseptivc Biosvstems F^iminphim \f \ 

- ^nmuiam Ma, rractions were dined Irom die column with a 

inc.ir gradient rrorn ; )- 0f) u „ SuiTt- ,<> h- \ 

' Lt ,J l ■' l! A acetonimir The flow rate was )i) 

^ minute ar.d e:uem a.:, aunmorea it : ] x :n ^ : ^ ( , nrn 



& f — -*«t «** suspended ,„ PBS „ d , es , ed mJm 

n , ~,,, WOTCd 5u ,„ ca p]gs for ,„ dumoo • 

-,,on One was f0 „„ d m ,„ duce a ^ ^ ^ ™> 

fractionated tunher bv RP-HPF r nn , ■ u „ ^scquemlv 
v HPLC on a m,crobore Vydac C18 column (Cat No -18TP5M^ 

M « P ^ El^AppUed Biosystems Division Model m HPLC P _ 
■,, .. oaei 1 '- HFLC - fractions were eluted 

with a linear gradient from 5- j 00% buffer (0 OS-/ TP a ■ • , 

„ ■ r((,05/oTFAinaceto '"tnle)w 1 thaflowrateof80 
ul/mmute. Fluent was monitored at nm p- • 

" L nm - El ^ r tractIons w ere collected and tested for 

mdUCU r - * g u,nea p ig , One fraction was found t0 J 

™ S OTH ofabom I6 mm indurauon ^ ^ ^ ^ ^ o ^ 

rne positive fraction ur,c ,-,>^.„„. 1 . . 

"~ " M — -AOt g el etectropnoresis and found to contain a 
?r0lem D3nd " ^PP-unatelv I : W molecular wei«ht. 

This polypeptide, herein after referred to as DPPH, was sequenced from the 
amino terminal usino a Perlnn irr , ,• , 

t P=rk,„ n mcr . App i Kd Bl0svs[cms Division proa 

sequencer as aesenbed anove and ;ou„d r„ ha ve v ,, m , 

, "na .0 na\e the N-terminaJ sequence shown in SEQ ID 

- - Co mpanso „ ^ ^ „,„ ^ sequenMs ^ 

T rCVC " d " U ^ h<,ra0i — — — ^ents of OPPH .« 

~- — — - - — - -i: 

a raa,ca - *• D,,Dn ™ — - — ** . ~ _ 

witnm uie >/ luocrcwosis cosmic \I7v- lrr \ 

, „ *- ,u ~ ' Xn °P cn reading frame of v, 6 bp was 

^entitled. Hie full-lem-th nvo • p ^ 

lu.un DXA s eaue„„ ,„ r DPPD :s P r„v,d e „ ,„ SEQ ID NO ^ ..„„, 
•he -^osnono™ ^l!-ien^ a,n,„„ JCM MueM ^ ^ m ^ u) ^ ) ^ 
' he abiiuv oi' t( !c : w,,,,^ 

•.jimuiate Human I'BMC : ( i nroiiferate uk! 

v •• •°-^=-' :!'N-v was .tssaveu '■; ; 

;:xamnlc -■"••^■n in ranic •) :j|»;»n ,, s 

vmna :o -''Miniate proliferation ... „ r ., „ , 

T , t r reaction ,u, e .uantmes o, 11V, more :nan 

•iat uicitea ?\ commercial PPL) 
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TABLH j 



\ssays to nppn 



PBiVIC Donor 



Stimulator j Proliferation (CPM) 



Medium 



1,089 



PPD (commercial) 



8J94 



0.17 



1.29 



DPPD 



13,451 



Medium 



450 



PPD (commercial) 



0.09 



5.929 



0,184 



.26 
.40 















c 


j Medium 


54] 


j 0.1 1 






! PPD (commercial) 


8.907 


1 0.76 






\ DPPD | 


23,024 


>2.70 



EXAMPLE^ 

T 7 " sed " cons,raa - — ^ - u**. Z..V, esPression 

* ;l * x,vs * w — „™ ::™,„ m TO1 „ 

■ ■urcrj;j:oM\ _:enum:c DN.A expression 

' r ' r !:y '™ r * tnc P r °«^ acscru>cd above ,n hxample V ' 

-n, Cones we, puHne, The , e , nil]ned ^ for ^ ^ 

T" U ; aS M °- K M °- : - M ^ ^ MU-X MO-,. MO,,. MO-30. MO-.4 

:,w M<>-.> arc provided <y<j ;m n^,. . ; 

1 :eN;-'j;:vei\ . with :bc corresponding 
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P-edic^d mmo aHd SBqurace ^ prov|<Je(J _ n 

for rhe clone MO- is provlded ,„ SEQ [D ^ ' * ' **«"« 

Clones MO-,. MO-30 and MU-35 w cre fomd , 0 show , ^ 
. Ma s„ owd some homology , 0 . previous)y idemified - - > 

™ - •« — Mra; 37 mo, « fomd , 0 show some homoIo 7 

^ " «« MO-3. MO, md MO,7 wre f uu „ d 1 

„ snow a ^ de5ree „ ^ „ ^ ^ ^ ° * 

some homology to .1/ lubcrcu/n^ h,~„ -k . Miowea 
h . heat shock protcm 70. MO-27 was found to show some 

homology to .V/ rubcrcuiavs cosmtd MTCY3^ MQ-4 and VfO ^4 
some homni™ . . ^ ^ tound to show 

-6 and MO-.9 were found to be highly related to each other and 
to show some homoloev to 1/ ,uh,rr„i v , ,■ 

M „,, M , M0 din - vdro "P^ -^transferase. MO-28. 

WCrC I0Und " * :demi " ^ 10 - h — ' previous, 
; Pr0tCiR ^ - *- - some homologv l0 a 

pavoustytoenntted U kDa .U ^/o. vw heat , hock protetn. 

Funher studtes astn, :he above protocol resulted ,n the iso ,auon of an 
" ™ — <» - MO,, MO,» ;mJ M 0,o - 

' cs "" Tivti> ,; " rnr3as,>n — - i„ ^ m 



. , " — -^ ••'"niu,ou I cs I ,M<v;w .Vl„.;:. M() . riJnd \ lo .;o 

-UK, O , )m , „ :;:oiOL , lc , unKn((UIi _ . . 

iYV> ,, 7/(ls ., i . ' — •^•'ou.m;. .M.uied :rom 1/ 



A.WIPLH 
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™s example .litres isolation of DNA sequences encoding .V/ tuberculosir 
-■gens by sc.en.ng of a novel express.cn library w„h sera from ,/ ^™W, n f e cted 
patients that were shown to be unreacuve w,th a panel of the recombinant M. Miosis 
anugens TbRal I, TbRaS, Tb38-1, TbH4, TbF and 38 kD. 

Genomic DNA from M tuberculosa Erdman s^n was randomly sheared t0 
an average s*c of 2 kb. and b.un: ended with Klencw polvmerase. followed bv me addition 
of EcoRI adaptors. The insert was subsequently ligated into the Screen phace vector 
fNovagen. Madison. WI) and packaged in ruro us,ng the PageMaker extract (Nova.en) 
The resulting Hbrary was screened with sera from several .V/ tuberculosis donors ^ 

*been .shown to be nernnvp?^ ^,_i ... 

~ - w Ui h^-viuumv lacntmed M tuberculosis ant mens as 

described above in Example 3B. 

A total of- different clones were .dared. Bv comparison, screens of the > 
Zap horary described above usmg the same sera did not result ,n any P osn,ve hits. One of the 
clones uas found to represent TbRal I. described above. The determined 5" cDN A sequences 
I" ! ° ° f ±C rCma,mny 21 «'herctnafrer referred to as Erdsni. Erdsnl Erdsn4-Erdsnl0 
trdsm.-lS. E:dsn:!-Erdsn:j and F.rdsn25, are provided ,n SFQ ID NO- ^- 3 - 
respectively, with the determined 3" cDNA sequences for Frdsnl. ErdsnZ. Erdsn4 Erdsn-^ 
nrnsn- -ErosniO. Erdsni MS. ErdsnJI-ErdsnZ? and Erdsn25 be.nu provided i„ SFQ ID NO 
-•---I. respectively. The complete cDN'A ,nsen sequence tor me clone Erdsni ,s 
provided in SEO ID \ : f v < • 

Q ° U,mf,anson « ^e J«erm in ed cDNA sequences w,th .hose „ 

1,16 " Cne ^ reVeakd n ° SIUn ' flCMt homologies to the sequences nrov.ded , n SEQ ID NO- 

30 ( >. ^ 1 8-320. 322. ^4 — — - 

■ ' ' ■ :md --i: Hie sequences 01" Sri > 

.D \m- ;f )S ;iu_;: — . . . _ _ 

— - 1 Hi - : ;,s, ;4o :ind 

'■•vere round :o shuw ,on lL . -o;;^w 0 - - - , . In ;.., , 

* ( -■■ ^ MK.mw:; ! e-:uences -rewi^v \ie:uu"iea :n 

*'H^i.V\\7/i>.y/A. 



XAMPU- - 
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tuoercuiosis antigens. * 

^> S , S »,„ SS e™ froml0lbereo|os ,, lnfectcdindiv|drf ^^^^^ 
" S " V " * «* *• «*» «* -fences dam™, bv mass 

„„ SE0 m NO: 343. C„ mpanson of ^ ^ ^ „ 

reveled homoloev to rhf«Shn^ " 

ogy !5b praunor m „ gen previously |dmnfcd ^ ^ 

'» . app^ch. *. hieh molecalar ^ 0 f v/ , uhmulm , s 
culture supernatant was sturiieri tw _ 

'" u >' ^ J "^n immunodominant amiecfls which 

may be usem n the dia^no^ nt ic ■ 

— b w " 1157 ,aVail3ble ^ ^ - — — • CX, show 

rrrr t- markcr tor * -™ - — ind , Id .r 

- H-.W ^'detcrmme the .denmv of these annuens two- 

- < < - < Protem spots >n the high moiecular weight rea ,on were 
entitled. individualK- -Ncsed -r/v^n, ■. ■• ' 
' • ^ maUCa,1V ^ »»*«d to mass .neetrometnc- ' 



spots 



The determined amino ac:d seau»nr»« t ^ 
, , , . ^qu.mxs tor three oi ihese spots , referred to as 

- and 4, are prov.ded in SEQ ID SO: ^ u^a* and -4- . 

• .'ninamon or these sequences w,th :ru»«- .., .,,.„ • 

..10,, ..] uene oank revealed that soot = . s -he 
previously uienti'Ied PrV ' ■. ' ' ' 

.nosnnoer.oirv.-uvatc klnav , r; , e , u ,, ^ 



-or : were determined to r, rVum ,vo n Vu s , r ... , , . ' ~ 7" V " ^ ^ 
neat shocK rowans Sdoi 4 -, V z ^ . <M '- S xs 

, ro[c , n . r r T u " JUL ™ nCa ! ° be the ^ '-""fied .»/. 

protein ivjt G To the- ht*<< 

b " 1 ,mCm0rS — — PcK-i nor the two DN,vic< 
n * ne:,,C%,oush ^cn show:, ,,^ c . Jt!i:t ,. ;n . 
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USE OF RFPRFSEVTATI\ -KANTIGEN-5 



TWs example diu^tes te ct ,e cnve „ css of ^ n 
polypepttdes „ stlI , Ksts for ^ diagn(lsis of k , uWtei;t infcc 

_ MM** were injected intradermal with ,00 „ of e,d,er PBS „ r PBS plus 

of > mn, or greater bang cons.dered poslllvc Q( ^ ,„ ^ ^ 

negative and 18 were PPD positive ,)f rh ,. „„ n . 

P ot the 1>PD p 0s „,vc individuals. 3 had active 

.... ...... p.ev, uu ,,y mtccteo w„h tuberculosis and 9 were iiealthv 1„ , 

second stud, .3PPOp oslll ve,„ dl v, d ^were, sl edw„0 , ; „ p„ s „ 



TABLE 10 

RESULTS OF Dili TESTING WITH REPRESENTATIVE 



ANTIGENS 



PPD 



negative 



PPD positiv e 
neaithv 



nnor I B 



o/: 



5/9 
3/5 



TOTAL 



Postal PosHotal PlJ5/TotaI Pos/Total ! Pos/T||ta| 



IS 



4/9 



0 1 s 



: 4' ,( ) 



0/2 



6/9 



4/> 



4M 

T-i7]"s 



FX AMPLE <J 
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of *. pep„ te from , e M „ d suppon ma; , ^ I - 

pep,,., Foilowns lyopl „ ll2aMn rf fc pnfe ^ odtj - 

using dectrosprav ni^ spectrometry and h,- , ' aractcnzed 



amino acid onaivsis. 



FXAMPLEJO 

A fusion protein containing TbR-r, rh,- ^s? un 

nnu i DKa-, the j8kD antigen and Tb38-1 wa^ 

prepared <ls follows. 

Each of the DN'A constructs TbRaJ ^8 kD and Pb^S i 
. - -o kjj ana b.?8-I were modified bv PCR 

„ n , , , , J * ' UN A Uds used 10 PWorm I'CR usine the primers 

PDM-o4 and PDM-oS (SEO ID \T>- i j>> ■ • - „ Pnmer.s 

'he ON A ™ n iT respeenvciy. in each ease. 

A JmP,,fiCal, ° n W " ? — «■* '« Ptu nurfe, : „, H) mM j\jp,. , 

-» or :he PC, ^ a! ;„ u ,, w , nc _ s; ; ^ ' - " 

i^rr Tt ,_ : . r m , U1 ' * u U ' VA polymerase 

„ ciB „ % ,- ". . -" un " l0n " ~ ""'»™«' - : ».»• folio™, hv ,, 

c>cjls ot % J L tor > sec anci ^ u r r i 

- ana ( ror ; mm . M(J iast , y , y ^ for 4 

«r . mm. :ol!„wed hy 40 eveies or for ^() see ,S' 
( and -r, ■.»r, m!r .^ flft , 1|vhv . : . r , or|iasa i;iir ^ ; " ■ - 8 
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1.5 min. 30 cycles of %°C tor 15 sec, 64°C for 15 sec and 72°C for 1.5. and finally by 72°C 
for 4 mm. 

The TbRa3 PCR fragment was digested with Ndel and EcoRJ and cioned 
directly into pT7*L2 IL 1 vector using Ndel and EcoRI sites. The 38 kD PCR fragment was 
digested with Sse8387I. treated with T4 DNA polymerase to make blunt ends and then 
digested with EcoRJ for direct cloning into the pT7*L2Ra3-l vector which was digested with 
StuI and EcoRJ. The 38-1 PCR fragment was digested with Eco47III and EcoRJ and directly 
subcloned into pT7^L2Ra3/38kD-l 7 digested with the same enzymes. The whole fusion was 
then transferred to pET28b - using Ndel and EcoRI sites. The fios.on construct was 

The expression construct was transformed into BLR pLys S £ col: (Novagen. 
Madison. WI) and grown overnight m IB broth with kanamycin (30 un/ m |, 
chloramphenicol (34 ug/ml). This culture (12 ml) was used to inoculate 500 ml 2XYT with 
the same antibiotics and the culture was induced with IPTG at an OD560 of 0*44 to a final 
concentration of 1.2 mM. Four hours post-induction, the bacteria were harvested and 
somcated m 20 mM Ins (8.0,. 100 mM NaCl. 0.1% DOC. 20 ug/ml Leupeptm. 20 mM 
PMSr followed by centnfugation at 26.000 X g. The resulting pellet was resuspended m 8 M 
urea. 20 mM Ins ,8.0.. 100 mM NaC! and bound to Pro-bond n.ckel resin .Invitrogen. 
Carisbad. C.U The column was washed several times with the above buffer then eluted w,th 
:m imidazole gradient (50 mM. 100 mM. 500 mM imidazole was added to 8 M urea. 20 mM 
Ths (8.0). 100 mM NaCI). The eluates containing the protein of interest were then dialvzed 
against M) mM Tns (8 0) 

The DNA and amino acki sequences tor the resulting fusion proteni 
heremaner reterred to xs Ihiia5- = 8 k;V IVX-;, -rmuhti :n M ; <j ID No- ;52 and :53. 
respectively 

A fusion protein containing the two antigens fbiM and Tb.'8-i (hereinafter 
referred to as TbH9-fb38-h wnho.it hinge sequence, wis prepared us.nn a similar 
procedures -hat desenbed above The DN v -.euuenee for the rbH'MWX-: lus.on protein :s 
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The ab.hry of the fusion protnn TbH9-Tb38-l , 0 induce T cell proliferation 
and IFN-v pr „duc tJ on in PBMC preparations was exammed using the protocol desenbed 
above m Example 1 . PBMC from three donor, were employed: one who had been prev.ouslv 
shown to respond to TbH9 but not TT,38-I (donor 131); one who had been shown to respond 
to 71,38-1 but not TT>H9 (donor .84); and one who had been shown to respond to both 
anugens (donor 201). T.e results of these studies (Figs. 5-7. redely) demonstrate ^ 
fiincuonaJ actum- of both the antigens m the nos.on prote.n. 

A fusion protein containing TbRaJ. the antigen 38kD. Tb38-1 and DPEP was 
prepared as follows. 

Each of the DN'A constructs TbRaJ. 38 kD and Tb38-I were modified bv PCR 
and cloned mto vectors essentially as described above. w,th the primers PDM-69 ,SEQ ID 
NO: 150 and PDM-83 (SEQ [D NO: 205) being used for amplification of the T,38-,A 
fragment. Tb38-1 A differs from Tb38-1 by a Oral s.te at the y end of the cod,n S re.o* that 
,ee P s the nnal am.no acd .ntnet while creating a blunt restriction sue that is in frame. The 
i 0Ra3 38kD-Tb3S-l A fusion was then transferred to P ET28b us.ng Ndel and EcoRl sites. 

DPEP DNA was used to perform PCR using the pnmers PDM-84 and PDM 
•S, - SHO ID NO: 206 and 20~. respectively, Md ; u] DN'A at 50 ng,ul. Denaturation at 94 X 
was perrormed tor 2 m,n. followed bv 1 0 cycles o. % C for ! 5 sec. 68 X for 1 < « ^ ~ 
C :or i.5 m,n: 30 cycles of o„ C for 15* sec. ,4 t tor ,5 see and "2 C tor r 5 IIun; and 

:ma,1> 2 X 4 mn - nPEP PCR ^ument was u.gesteci w„h EeoRI and Eco72I 

and clones directlv into the ijET^SR-^ lSkrviK l \ - 

^ j8UJ ' jK ' 1 A -unstmct wnich was digested with Dral 

iiC ° R1 r ' 1C fi,SIOn M ™™ *™ confirmed :o * correct in DNA seuuencme. 

AL'^iHiininarn protein ws ^r'^-u"" . 

^ ^ - escnoea »°<^ . nc !)\A ana amino ac:u ncuucp.cs 

:he resulting r'asion :)mte:r < >r- . , . 

— - .o a> : r>»- _ ; arc proviueu >LO j[) \(j 

- n S and 20Q respectively 

The reactivity ,nhe fusion prote.n Tbl -2 w,th sen, .rum M. !u hercui<„„- 
■ntected patients was examined bv F.L'SA u.s,ng the protocol described above. The results „ 

:nese studies ■ Table ! ! < Jnc-s:-,-.. - , f ,11 r 

" J 1 1L ,our ^tieen:; luncrion mdenendentU :n the 
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Inactivity ok rsF-2 Fl'siov Reowbinant w 



ith TB and Normal Sera 
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A fusion protein containing TT,Ra3. the antigen 38kD, Tb38-I and TbH4 was 
prepared as follows. 

Genomic U DNA used ,„ PCR M ., en£th ^ R 

-Kb the pnmers PDM . I57 _ pDM ., 60 (SEQ [D ^ ^ ^ 

DNA - .00 n^,. Denaturalion „ « - C » perforrned for 2 mi , ^ bv „ ^ * 

for ,0 sec. 6, X ft, 20 sec Md 7 ; - c for 5 mK m , My ^ ^ ? ; 

0 rn , T* FI . TbH4 PCR rr^cn, m ^ ^ ^ ^ ^ , ^ ^ 
B,o, abs ) ^ d oned dte|y , mo fc pET:SRa3 , 8kD/38 , A consnia ^ 

»h.cb ™ ^ w ,„ M Md Ecoftt TV n„ 0 „ construe! was confirmed ,„ * correc, 
" " protcn was prepared as described above The DNA and 
-no ac.d fences tor the resuiung ton protetn (herein, referred l0 as TbF-6) are 
Provided in SEQ ID NO: 150 and 351, respectively 

A fus.cn prote:n containing the antigen 38kD and DPEP separated bv a linker 

was prepared as follows. ' £ 

38 ^ ° NA ^ USCd 10 Perl0m P ™ -ng the pnmers PDM-1 76 and PDM- 
, <ShQ ID NO: and 353. respecnvelv,. and 1 ul PET28Ra3 38kD/38-l/Ra2 A -P DNA 
-HO n g , u l. Denaturation at 96 T was penormed rbr 2 m,, followed bv 40 cvcles of % C 
:oi 30 sec. "I T lor ' - -e- -m,< "' r- .• - ' ' 

■ - onu . C tor : m,n ana af) ,ec; ana riaaiiy bv anneaiinu at — C 
■or , mm . n le nv, of p nmers PDM-Pi. P DM . r: . an . , DX , r? . p^, ^ 

r a,Cd heaUnS 10 05 C ^ 2 ™ ^ *™ - C slo,,v at o , c- SK 

JPEP DNA was ^ cd 10 Perform PCR as aesenbed ,bove The ^ i-n 

. ,louu - ln< - '« tragmcnt was digested 

•■vuh Eco RI (New Enc'and B.ohho ■,„ i i • ™uskq 

■ ° iabS> c,onea :nw ™odificu PT-AL2 vector wh.en was 
-romega, and Ecu Ri. Hk- rnoauied pHAl.: construct was tole ned t 



" ' ' P ' wta; J "" ^ ^ moauied nHAI..^ .•nn«n„, 

leu •(> 

"■ :nOTe ,lLsi • ^ ! sue. Ine 

-neatea ,ets of phosphoniated pnme. .PDM-; - P,,M. r: PDM . ;73 pr)M ., 74i 

were cloned in The DPFP £>rr? ^ 

PE ' ' CR iragmcm W ' 1S Feo RI and Eco - , md d(med 

into this second jonstruc* 'v'^^-- ; ■ . 

, ■ ^*.cr.M, f , 1 , J -,,, Al , !;aptoi>|fltoi ^ 

f'C:i K ! ; •:> 1 r irin i 
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modified PET28 vector. Tie fusion construct was confirmed to be correct by DNA 
sequencing. 

Recombinant protein was prepared essentially as described above. The DNA 
and ammo acid sequences for the resulting fusion protein (hereinafter referred to as TbF-8) 
are provided in SEQ ID NO: 354 and 355, respectively. 

One of skill in the art will appreciate tnat the order of the individual antigens 
within the fus,on protein may be changed and that comparable activity would be expected 
provided each of the epitopes ,s still functionally available. In addition, truncated forms of 
the protems contaimng active epitopes may be used m the construction of fusion proteins. 

From the foregoing, it will be appreciated that, although specific embodiments 
of the invention have been described herein for the purpose of illustration. v; 
mod.ficat.ons may be made without deviating from the spirit and scope of the invention 
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SEQUENCE LISTING 



GENERAL INTORMAT I C N : 

(i) APPLICANTS: Reed, Steven G. 

Skeiky, Yasir A.W. 
Dillon, Davin C. 
Campos -Neto, Antonio 
Houghton, Raymond 
Vedvick, Thomas S. 
Tvardzik, Daniel R. 
Lodes, Michael J. 
Hendrickscn, Ronald 

TI — ^invention- compounds and methods for immunotherapy 

AND DIAGNOSIS 3 F TUBERCULOSES 
ill; NUMBER OF SEQUENCES: 355 

iiv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE. SEED and BERRY LLP 

(3) STREET: 6300 Columbia Center, 701 Fifth Avenue 

tC! CITY: Seattle ^Jfo 

(D) STATE ; Washington *■ 

(E) COUNTRY: USA 

( F) ZIP : 9SIC4 ■ "032 

!v : COMPUTER READABLE FORM : 

.'A> MEDIUM TYPE. FIodcv disk 

. C) OPERATING SYSTEM: PC - DCS / MS - DOS 

:D) SOFTWARE: 3 H c.-r^~- -<^^c^ — ^ 

-=k^._t„^^„ .-.e lease 4 . . ,; , /err. ;cn -I.JO 



TORRENT APPLICATION DATA: 
fA) APPLICATION NUMBER : 
IB) FIL I NG DATE : J S - MAY ; j 3 B 
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(D) TOPOLOGY- linear 
ixi) SEQUENCE DESCRIPTION- SEQ ID NO 1; 
CGAGGCACCG GTAGTTTGAA OCAAACGCAC AATCGACGGG 



~ - " ^- ^ * wvrt i. '._*jI_Ij\jE- 1L 

ATGGAGTTGC TGCAGGCCGC AGGGNAACTG ATTGGCGC 
GCTACGCCGC CCGCCTGGTG ACGCGTCCAT GTCGAAC^ 
GTMTGCGCAG GGNCGCACGC ACCGCCCGGT GCAAGCCG 

gncaccag:;g ancaccccon nntcg:tc:;nt tc~cgnt^ 
2) information ?cr sec :o no:2. 

ii- SKC T .rEN"CE CHARACTERISTICS - 

(A' LENGTH: 7b2 base cairs 
(3- TYPE: nucleic acid 
;o: STRANDEDNESS : single 
TOPOLOGY: linear 



CAAACGAACG 


GAAGAACACA 


50 


CCGCGGCTGC 


AATCGGCGC 0 


120 


TCGTATACCA 


GATGCAGCCG 


180 


CCCCTGACGT 


CCCGACCGCC 


240 


ACGTGTCGTT 


TGCGAACAAG 


300 


GCATCGCCGA 


CCACAAGCTG 


360 


GCGTGACGAA 


CATCCAGCCG 


420 


GTCCGAAGCT 


CTCGTCGCCG 


480 


TGCTGTCACG 


CGCAT CGGCG 


540 


CGGNTTCAGC 


CCGCTGTTCA 


600 


CGCGCGTGTA 


GCACGGTGCG 


660 


CTCGAGATAG 


GTGGTGNCTC 


720 


GNAT G A 




"* 6 



3EC r CENCE DESCRIPTION; 



™™ ZZ^tl^l AAGATTCGAT TCTTGGGAG S 2TTGAGGGT: 



6C 
12 0 
18C 
240 



AT0CAT0A0C ATCACCAT2A COA^GAACV^- ~~„^™,^„„„, rt 

^,^ PW , JMrto*. .^Luuwvjn^n _ -irt.C CTC CGT CTTCGG^GCA 

^r;;:;;^ GC3AGCTGG ~ coot—cog oaagcgsgta cggagagcgc ggtctccggg 

:::lTSf G — GG —TTG OTGGTAGTOA AACGAGGCCO CAACGCCSGO 

;" G I GGA ^; A AG ~ GA ;f ACJ rcGGCT ™ :: --catcccga oagcgacata 

S.l^l'^Z^l -~ rtGt — '-"^0 CATGCTGAAT TOCGGTTGGA aaacaacgaa * fco 

-.■-.-vrwo^^ -^^^TCGS SAGTCTCAAC SGCACC~\~- ^A^r— 

;: ^.oo.o. mACGG ogacgaggtc oagatcg oca asctccggtt ggtgttc— - 

^^ CGCGA GGATGACGGG AGTACCGGGG "CCCTCACG GCACCCGATA 

"ITZT.^^ — -GATS TCGATCGGGG oggtcotccs asctgctaco accggatttt 



480 
S4 0 

NGGGTGACCO 600 
SCGNCCCOGN 6 ^ 



j-J^j-\ '-..-...« ^ . ^ , „ _ . J .. ...;[T!< jAAAN GGTNA 



STRANDEDNESS 

T'S PCL0GY 1 : : 
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INFORMATION FOR SEC ID NO : 4 : 

SEQGEUCE CHARACTERISTICS: 
A, :*EWGTH: 44"^ base oairs 
! 3.) -Y?H: nucleic ac:H 
■ C: rTKANDSCNESS : 3ir.Qi- 
■■ —> ■ . ^ r'C L>GG V : - inedr 

>:i SEO/JENCE DESCRIPTION SEQ ID NO: 



-^vj-AT'jAAC ACGGCC3CCT CCGATAA^"^ ^ w 

^ ____ , v^G^G_ .AGGGTGGGC AGGGATTCGC 
— '^..AG rt ^ CGATCGGGTG GGGGGTCACC 



II TCCGATCAAC TCGGCCACCG CGATGGCGGA 
catctcggtg AACTGGCAAA CCAAGTCGGG 



, ^-^^o^^ ^^oGCCTGAT TTCGTCGYG 

-vju-^jGCC aa . .gg a 



TV RE : - u - ^ _^ . , „ 1; 
CICVVIJEEDKESS : 



3 „. , 

AA7G3C; 



^ -T'SNATTCC 



J CAAT TTGGCGGGG I' 



i n-G..CTGGT GGTGNTAGGT GNCTG.VCTGG 

ina^; Jl '^ AA;;: " :3AT - ; cgaggaacag ggtgtnc~o- 

- "" ;J - -U-IA.-.-AGNCJ nttgatgnga 



4C 



GCGGCCGGCG GT3CTGCAAA CTACTCCCGG AGGAA"-— ir«T r -. r „ 

ATGCTGGTCA CGC-G— ACTG..GCA7 caauatcttc 

AC _ Ac ^;r: — — tgttcgggtg tggccacggc cgcgcccaag 

^:;^;r:: ?^^- GAA A — - c "at accggccagg cgtgccagat tcaaatg-c- 

GAAAA^-ACA „ ^--^ ^v-AG^A^T ACCCCGACCA GAAGTCGCTG 

gcgcgacaa g TTCCTCAGCG cggccacatc gtccac--^ 
cg^acg" ^cacc tcggccacat accagtccgc Sta^ScS 

ASriS ™^ CG CCGGCGOCAC GCACCCAACG 

CTGTGGCAGG cS^.£^.~S ™^ =A CG 

ssss s~ — ™ -™ =1 



6 0 



:acga gtccaac^ ^!3 GGGT G — tcgaca acaacggcaa lao 

^ ^oGCG GCAAGTCTCG GCA^"— . n 

-^/wo*o ATCACCGCGG TO GAGGC"""^" — * 240 

300 

420 
44" 



Id J 



vTCGAGCTAG TGAGCCCAAA jo. 
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ii) SEQUENCE CHARACTERISTICS : 

iA. 1 LENGTH: 533 bast! pairs 
■J 3) TYPE: nucleic acid 
(C; 3TRANDEDNESS : single 
CD) TOPOLOGY: linear 

I XI) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

"SaS ^ ACCTCA ctaaag ^ caaaag™ agctccaccg 
: g.tctagaa ctagtgka™ yyyckggctg cagsaatycg gyacgagcat 

C^cSI ^ r CGGTGA TCGAArcACC ^ACATCC TGcStC^ 
c£££2i — ^ CCT ^CGGCCG CAGTCCCGYA acgcgctctc 
^™ ;:;:r CGGY crrccYCGAC gccgaggycg acgacgacat 

::r":r TCC ^ttctgc gccggactk acctcaaggt 
;;: C ~^ ACA -^ccg CG gtgggcggcc atgaccaagc 

, ^Lu^ uu -jCTCGAACTG gcgctgtact 

gcttcgncg; 



.CCCAr^r- 



Jrtu-_i,-3crr gc 

jouACTC AGTG"" ~i""^f"" t> , . ^, ^ „ _ 

1 - j^oACCGA C G C 

INFORMATION' FOR SEQ ID NO 7 : 



CGGTGGCGGC 
TAGGACAGTC 
CACCGACGAA 
GGCGGCGCTA 
CGACGTCGTC 
AGCTGGCCGG 
CGGTGATCGG 
GCGACATCCT 
TGCTGCCCAC 
* GG ATGAG 



6C 

120 
180 
240 

300 
360 
420 
48 0 
54 C 



(l) SEQUENCE CHARACTERISTICS : 

= A; LENGTH: 1362 base oairs 
— - nucleic acid 

■G: CTrANDEEIJESS ■ sincrle 
. TOPOLOGY linear 

;X: ' SEQUENCE DESCRIPTION: SEQ ID NO : 7 . 



3 ^ ^>r\U 



---GCTGCOOO AGJCGOT 



-Jv_ . — >_G 



,A " " G ^r^ GAG AGCrjGGCGr " AACGGCGATC GACG 

JAG G AG G GAG CCGAATCATG AAATTTGTCA AC "A 

-rw jCTGT CC I~3GACGAGG GAITGCTCAC 

3CTGGTGG jC CAGGTGCCSC GTCGCCGCAA 



- JJ -^ :CTGCGCTCC 



jTACGCGG ..AGGCCAAAC IGACACCOI 



:a 



"AC 



jo a. GAC^ ..^o^CGCC gtatgtggcg -gggcggc— ~- , r ^™,. 



■ wvn „ .-v. • . 

- -i. j3GCA/\ CA 

j.-^ — \ _ jC g c cocac 

^ ■ -ACGGCG AGCC^ 

■~ - - ~i ^ _ jA c c tg c a c 

jTGACCGACG acgac 



cacccgtc: 



JGac.ggt::t tcgcccggaa 

-~ i/\GCC3J iA-YCOCTCCC 

■^----J- ^ _ j; '--Jw.^^lo 

- J GTGGACGA ACGAGCACAC 
- "~ . jCCCTGC'GA •-- r ^.^, — ^, 

^ JJ "' * — --v_ ^ — — GAT GC 



- GG C CAGAG T 

CGAGTTCGGC 
v- vjGCTGG 
GGAAG CCGTC 
CACCACCATG 
AGCACCTGCC 
ACCGGCGGGA 
GGTGCAATT^ 

^-tvjVjvjoCCOG 

-j-vj - GCGCGOG 
CGACGATCTG 
CCACCACCTC 
- - ' «. GvrGGTC .7 
CGCCGAGCTG 
CCCGCATCAG 
GGCGCTGGTT 
GATCGGCGCC 



36 0 
420 
4 0 0 



: 9 " 



CAGGT 
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■ : - -SEQUENCE CHARACTER I 3 TICS : 

(Ai LENGTH; 1458 base oairs 
iB) TYPE : nucleic acid' 
■:c,' ST3ANDEDNESS : single 
fD) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION- SEQ ID NO : 8 ; 

oS?SS SEES? £~ Soccer ggccgaggga cxggggcgcg 

TGGATGACGT GGCCCG^ ££%£L C GAAGCCGGTC 

CTAAGGCCTT GCTCGGCGTG SSaCg1~ G ^GCGGAG CTGCGGACCG 

TGCGCGAGCG CTATCTGCTG CaSacSgc ™GGGGC GTGACGGTAC 

TGATGGACGG ATCGGCGCGC ~GTGT— ~~ tf^^ GGC ~AGTCG ACCGGCGAGC 
CGAGGCGGTG GGCCGAGGGG ^C~C^C~ " '-^^TATGAG CCGGG CTCOT 

CGcrc SI ^5;~ 

GGAGG CGGCACC3GA ~— -~~1^ "" C2CT ~ G - ACAGGCrGCC GAGCT3C \G ~ ^ 
--TCCAGGGG CGGGAC3GCC AGC~AC~~~ ™ C=7SCS ACCCGC ~GG GATCGGGTG 
-3GGTGTGGT CTCCATGGGC GGT~GGC~~ crr-r^^ ACGG CTGTA7 GACAGTGC: 
^ACCGGGA TA7CTGTGA? 



60 
120 
180 
240 
300 
360 
42C 



~ "JAGG A 

~ r ' u " , " 1 — ^ w ^ ^ — j 1 -^ uAu CTGC \G - • r- 

600 
6u 0 
720 
730 
840 
90 0 



CGACCGGGA TA7G7G7GA7 ^CACC^ TA7GGC7GTG CTTGATGT'J 

A777CAAGC7 ATCGG7TGG7 GTGACCGA- ATCC ^^C GAGCTCCCG 

TACACCGGCT GGTGAATGGG CGAA— ~~ ."X^ GGCCGTC GAA CGCAACGGC 

7G77G3AGGC CATCTGCAAA 3~"~ ^T^^ GC3GAT3C =- GCGGCCGAG 

^AGGGCAAAC GGGG7GCCG3 GGAGAGGC~3 ^T^ — 

.»-™„^7 CCCAC7GCTG "77AG3AG~ -vr-i-a-— X""" ACCAACCCG7 1 C2 0 

3GATGCTC3 C CGACGG7CG" G~~^~ ^I^- " — G ^CGATC AACCTCGCCC * ' 
7GCGG7TCC7 7GATGAC37G ATCGATGT^ \ GGTGTGGGGG 
-GGGGG3C3C GACCC3CAAG ATC3GGCTGG GA^™ ^ZV™ ^GGTGAGG 
CAG7GGG-A - — " " ""„^ - 77 3G 7GGAA C7Gr— 

VTAGCCGG ~~,~,-.~Z Zt^^ ~^c£ 



108C 
1140 
120C 
1260 



^ " v -' - - ->■ ^GCGCC TjCAC ^ ""^^ ~ - . , ^ ~~ ~ - - .-\ + G C G T C " i^>^ 

^ -^--j^-.-i .... JGAO"-"— — - ^ . ---- 

r — - 13 3 0 

GGGCA — ~ — --AACGCA CAGGT 



:N::r:-:at:;:; for sec :: 



14 4. 
14 -~ 



MO ■ 



sequence c:iaracter:gt::-j- 

■A. LENGTIi: 2G2 - cls ^":^^ 
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GGACCGGGAT GGGA^CGGC- — _ 

c™.^ SSSS; " ora "« 

ATCAGCCGCC GCC3C~3C"~ — ~-. r ~^„ 2, ~ ^-GCT GCC^/TTCCG TTCATCCTGA 

ccACTcrcc ccc3c^c :2x^ GCGCCTCCGC 790 

GACAACCCCT CG,— rr^ ^ -^^i -oCTGA TGGTCGATTC CGCCAGCTGT 



(2) 



in^crvat: — kor seq id nc-; : 

i: SEQUENCE L'HARACrERISTTCS : 

A. LENGTH: -oc 3a se oa:r: 
3 ^c,,:: acid* 

-GPOLCGY; linear 



SEQUENCE DESCRU 




riON. SEQ - .NO: II: 




660 
"20 



GACAACCCCT CGCCTCGTGC CG ^ — -CTGA TGGTCGATTC CGCCAGCTGT 840 

862 

INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 10: 
TTGATCAGCA CCGGCAAGGC GTCACATGC~ — ~ „ _ 

.A.ACCCCGG GCGCCAAG AT "GTCGAAGTA ^TCGC~™ ^~ZZT *C 
GTGCCGAAGG GCGT~GT~~~ - — -*GCTGCCSC : GAACCCTGGA 

TTGGTTGCCG CCGTGCKTC CA^GCCO ^"cS^ S^™^ CGCG °ACGCG 

::;;:r" CG gtagg -cac agtgcaagtc a-ct^a ;™~T:r: ~ CAGGAT 



--_..„CSCA GTGTTCAAAG — ' — ^^ TCGGCA A " GCGG ~GCA GTGATGAAGG JO n 
TGGTGGTTGG CCGGGCACT^ GTC~T~ CATGGAA ^G CGTGCGGAGT 

ACCACAGCGG GCCGC-GTr AC^C ^GGCGCAC GGCGATGAAG 

TGGTGGCGG7 GTCGGCCGAC GAGC~ - — lZ^ CZAGGC C ^ GT ™GTT GTCGACGGCG 
3CGGGGTOGA CCTGG~GG~ GAAATGC GCTGAACACA GCGGTGATCG 

CGGAAGCCAC C C GN G A CATT ^ ' G 3 A C C GGNGT GACGNCTCGC GATGTCACCC 



360 
420 

54 0 

s oc 
si: 



^V,.,^ JVJ ' o^A.. .3 CGGCGG- .AAG 

"ACoTGC SATCG"*Gr~~ » „ ^ ^^"^ — w -j^^o^ 

---"a^aca acgcc-^c-g 

33CCC3A? ^l:2;:-f AT0GGAC"J 3i" 

.rt^.-vnA.ji agggcgt: 



4 a : 



GACGGAC CCACT-C""- — - " WM *^ CG7DAG 

-CCAAG CCCTcil^- r^ GrACCA —^GTGTG GAATGATCCA 

^CGACA ACTCCG3TAC S^™^ "t^"™ GACCG/tTTAG CGTTATCTTC 

SC3TGOG GCAAAG3CG- -AG — JCk- "G CT CG AC GG TGTATCCAAC 

^a:aac:; gaacgtl™- ; gcgtcgccgt cggcgccag- 
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SSSS SS ~~ — - GGCCCwGTG 

===tc.c^ rm*a ££££ ~S "S;= c ;f 

C ^ L ^° -C^AGGCGG GATGGGCGAG 

INFORMATION FOR SEQ ID NO-12- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE • nucleic acid 
fCJ STRANDEDNESS : single 
(D) TOPOLOGY . linear 



i xi ; 




iEooEMcs mam,. SE0 m Xl:12 

57 SSI? SSS J-—. 

ACATC^r— Gc — ^ GTTCGACAAG ACCGGGCATC 




-re-fAT—J ? CR SE- ID :;c.I3: 

- oE-usnce :haracterist:cs: 
A le?:gtH: ; — .. 



10 
12C 



AGCGGTGGC- AC'JTCACCCG GCTCAAGGTC ACCGTCGACG 

IliilllsII = 

CGACGACGTC ACC^CGA^ ^C~C~ CG ^TTCC 
GGGGTTCAGC 3C"-ACGG~ 'GTCAAGGCG ATGTCCTACG 

GCAGATGGAC G~^~ I'lf ^ TCCACCSAGG 
-3AGCGGGC- ^GTGCGC GACTACCTGC 

GGAGCTGGAC AGCAGC J~ ^^Z^ '^ACGGTGCTG GGCCGTCGCC 
^'^'^ -^^GTGGG GT.Ar^r- 



420 
48G 
540 
600 
660 

^cca= Kam : : =Ac r;;:; 

,~ ~ r= ^ - 

TAC2G3 TTCGACG^" — ~-~- Ar _ 1^ " — ^-^CGACA 960 

GGCGGCGCAC TGAGTG"""""" ^™1^ T 3TC3GTGGGC TACGGCCGCA 1020 

Jl - J — ^--T GGGGCGGGAA — — — - — 

'C33C3CAAT TGGGACCCAG — ."^^^ 10RO 



WO 99/421P6 



PC I 0326S 



78 



■ - j i~GoAAr 

^L™3GCGA CGGGGJAGCGG GGuaatggcG :?GAG7GAGGA 3GCGGG 



^^GTATCG G7CCATGCC7 
-jo-oGAAT 77GGCGGGGC 



-G77GTGGCT ATG^GGA^- -v— ' — — ^.AGGx GCCTGCCTGG 

TGAGGGGGAC 3GCG777GA^ r^GC™ — —I^?! GGAGGAACAG 3GTGT7CCCG 
'AAAAGGGT7 3ACGAGCG7'J \V^A~ — ~ -l^lT^^ 7 C7kCGCAGTC ^ CTTG A TG C G A 
"GGG7ATTAC CAG7GCG~A7 JT^3A^^-^ ^-72?^^ ----AAAGTG GACAGCT7GG 
-CACCTACAA G3ACGAGCAG GG7S-"~'~™ — ^I^ TC " JCT?0C::;G ~^ AAGGGCGTAT 
^\G7G77CGA CGAC7GGAG ^ AA ™ — ~™ ^^1!^ AGGCGACAA ^ A7C7CGG7GA 
7GGA7CCTGC CGC7GGGG7 ; A"^™^ Z^Zl^'^ ACT ™^AC7 7CACGCG7G7 
-7A~GGAAG7 GA7AGACGGA ATTTC^A-" I ^^Tllfl GACGAAC -~ 7AAGCGCAAG -g ( 

3-1 ^ 



660 
720 
80 



jACGATGACG CCCCCTCCTG GGA7GG77CG CCAACG— - rrrr-^~~* 
CGGCGCGGTG ACGA7AGCGG 7GG7G- C — CG ^ G ^ GG ^ 7G7TGGCCA7 

CGGGTTCAAC CGGGCACC^G C S^v :r^ TCGGC GGCGCGGGGG CATCCCTCGT 
AAGCATCCCC GCAG CAAACA SSSS rC^^ GTGGCTGGGA GCGCGGCGCC 
OCCCAflWTC G7CATGT7GG ^Ic^C ^ 

GCCTCCCCTG GGCAG7CCGC rccrw^ ^ACAACCAC GTGATCGCGG CGGCCGCCAA 960 
ACCCTTCCG GTGGTGGGGG cSj £££££ 

CGTCTCCGGG CTCACCCCGA TCTCCCfGGG GCCGTCGTCC GTGTTCAGGG 1080 
GGTGCTGGCG ATCGGGTCGC CGCTcSS SSSSS SSSS? 

CGCTCTCAAC CGTCCAGTGT CGACGACCGG n™2^ GTGACCACGG GGATCGTCAG 1200 

CGCCATTCAG ACCGACGC~G oSS^ S^***"* ******** CCGTGCTGGA 126Q 

GAACGCTCAA CTCGTCGgIg T^Sc GG0GG0G «* ^T^CAT ^ 

TGCGCAGAGC GGCTCGATCG £££££ tSSS c'SSS 1380 

CGCCGACGAG TTGATCAGCA CCGGCAAG"- GTCGACCAGG CCAAGCGCAT 1440 

CAATGACAAA GACAC r^S^ ACATGCC TCCCTGGGTG TGCAGGTGAC 1S00 

~ n ,™.„. --I^:™ ^-C^GAT CGTCGAAGTA GTGGCCGGTG (mr~nr~.~ 

-GC3GACGCC TTGGT~GC~S '-----I ::^ AA3GTC -ACGACCGCC CGATCAACAG - s , 0 
- TTT CAGGAT CCCTC3GGC3 OTMCCGC^C aS^S- '™~~ G TDGC3CTAA C : S30 
GTGATGAAGG TCGCCSCGCA GTGTTCAAAG C " ' " C — =~=A AGGCGGAGCA ;;, 0 

! = j INFORMATION FOR SE 0 ID NO: 14 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH; 1058 base~oa'- a 
t'E) T^-?E: nucle.c acid' 
:C; STRANDEDNESS : 

v ^ ' 70 PGLOGV 1 ine^ ^ 

^^.PT.CW: ID NO: 14; 

~* — AC^GCG G7GGCGGG^G ^^^^^ ^ > , „^ . 

AGGA7GC GACG7GGCAG G7T— ^ - - — ^.""^ "Zl .^AATTCGGC 



- _~GCGACGG CGAGCGC'GG AT ^^.r"^-— - 
-^CAATCAA CCTCCATTCCJ GCCTGCGGGC 

TSACGTCC3 ,-~G~™~ ^^Z^r '^ tji - AAA 3 00 



GAGGAGGC CGGCAG7CA7 GG^^Arrn^- 

^_GG GCC7GCGGGC — A7^ AC , - C ^ GCG — Z40 

A7GATGG AAAACGGGCG ^ ^^^uo.AG 7GAGCGCAAA 



360 



J 7 J7GAAGA7 3CTTGAT"r7 JG- ^C-AA^' ^GATG CGCGCGAGC7 

aGGACGGCTG 3CAC^^ ' — - ^,-^r^„^ ^^^A.^GGC GG JGACG07G 7GGAT7GCC~ 

-^Q^AGTZ GAAA7GGAA- 3AAC3CG7"^ — -^-C ".^^7"°^ — ^ A ^CAGC 

'AAGGG7G77 AAT3GNA" *" * i-- 



WO 99/42<r<> 



79 



(C! STRANDEDNESS : single 
!2! TOPOLOGY: Linear 



!xi) SEQUENCE DESCRIPTION: 3EQ ID NO:15: 
GAATTCGGCA CGAGAGGTGA TCGACATra- owr,,--. 

GGCGGCGGAG GCGGTCCAGC <^£S£* CCCACA TCCT GGGAACAGGC 

CATTGAGCAG GACMgS^ ctcA^r I AGCGTCGAT ^CATCCGCG TCGCTCGGGT 
AGTGTCGTTC ™ CGGCAAGATC ACCTACCGCA TCAAGCTCGA 

AAATCGCACG GTTTgSg^ J^f^ GCAAGACGCA 

GCGCGGCCCA GGTCC-r^ 7Z' *~ ATTTTGTGTC TGCTCGCCGA GGCCTACCAG 

CCGgSaI SScGCg' ScC^ CA0GC6re « TCGC^TTCC GGCGGCCA^G 
CC3TGGCCAG CcSc^TG ^S?*™ ^SG^CC TGATCGATGA 

AGCGTCCGTA GCCGGeS GC ™^ 
GG CTGC^GCGl. CCTCAGTGCG GCCAGCGAGC 



I^r CRMATZON ?CR 3EQ T :iG ■ ' 6 ■ 

jSQ'JENCZ CHARACTER 1 5 TI C3 - 
>A> LENGTH: 913 base^oa- s 

TYPE: nucleic acid 
C ; S TRANDEDNH 33. single 
(O! TOPOLOGY * - nea . 

' #• 

X: ^UENCE 3ESC?.:?TIOK: SEC ID NC : IS : 



- ^ O -_ 



CACCATCACC GG~~^— — ~~«~irZ^I 7 -CCGCCG?C GCCGATCAGC TGCGCATC: 



--ou^ACCGG TGGTGGC: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
542 



— "C3CC3GC3L' CGCCATTGC" A~AC— ~ --S^ ATGCCACGG - 120 

-.^■^CCA CC3TC3C— —v.™! AiAC ^~ CoCCGGGGG CACCGTTACC I3 0 

-AACCCCGCC GCCGGCACGG TTGCC^— ^Z^^ ^ CCGAAT GAAC ~GCGC ; 4 . 0 

L'GGAACAGGG AMGCACTGTT JC~GC^G~~ --^X^"^" CCGGCCGGCG CCGGGAATTG i OZ 

3CT3CGGGA3 ZGGC^TTAC — ^^™" GT TAACGGCGC ' r GCCGGGCGCC 3 6-: 

3TTTGGGGGG AA7ATT"GG ~ — ^C^r^" ""SCGCTTAC CGGCGCCGTG , 2: 

--^oAAACA ACAGCCCAAC 3.3— — — " J ~^-*<-^- -G^o^GG _; S0 
T'lACCGCCAG CACC3CC3TT AA7G^~ CACC3GCCAT 
-CGGGCGCC3 GAGNGC3TGC -^itZ ^ C ~~>-C GGCCAGCGCG GCCCCTATTG 

CGGCVGCGCr GGACCCAC-G ^-----ft ^Z^" —AAAAGCCCG GGGTTGC CAC , 6C 



^ ^ a3(i oalr: 

1 type. - acid 



\U> <><> -420~(> 



PC T 1'S9 1 >,(I32()S 



80 



G7CA7GG7TG CTGAGC^^GC <"~~~~- 

GGCGGCCCC3 CGGGCC77G7 CGcIcGAr- —Jf^^ ^^CSG CGCCGGCCCA 18c 

CCC3TCCGC3 A7GGTC3CC A^SS ^CCCGTCGA 24C 

CTACAACAAC GCCG7GGGC3 CCGGGAcS S^™" ^TCAACA CCAAACTGGG 30 C 

GACCAACAAC CACGTGATCG C3^C3CC^ GTGTCG7GCT 360 

CCAAACCTAC GGCG7-GA~ Srrr^ CuACATCAAT GCGTTCAGCG TCGGCTCCGG 4 , 0 

GCTGCGCGGT GCCGGtScC J™* 3 ^ CAGGATGTCG CGGTGCTGCA 480 

GCCCGTCGTC GCGATGGGCA ACAGCGG-GG Srrrr^I SGCSGCGTCG CGGTTGGTGA 540 

CAGGGTGGTC GCGCTCGGT AAACCGtcS ^ ACGCCCCGTG CGGTGCCTGG 600 

GACATTGAAC GGGTTGATCG £££££ £££££ GTGCCGAAGA 660 

CGTCGTCAAC GGCCTAGGAC AGGTGCTC"G T^I, cccggtgatt CGGGCGGGCC 72 0 

GCTGTCCCAG GGTGGGCAGG ATAACTTCCA 780 

CCAAATCGGA TCGCGTGGGG GOT^ccSc CGA ^GCGGG 3 <0 

CT7GGGTG77 GTCGACAACA ACGGCAAr- XX™ ^CCTACCG CCTTCCTCGG 900 

TCCGGCGGCA AGTCTCGgS cS™ CAACGCGTGG TCGGAAGCGC 960 

3ATCAAC7C3 3CCACC3CGA 7GGCGGACGC G^SL ^S^" ACSGCGC ^ G « ? 0 

G7GG3TGAAC 7GGCAAACCA AGTCGGGC3 GATCATC GCG GTGACGTCA7 1Qao 
GGGACC-r:, -C7GA--; C"" ""^ ^ "^GTGA CATTGGCCGA 

^CCGTGAT TGCCGCG7GA GC™^ C ™ ^ GGA77GGCGC ^00 

GCAATGAACG AGGCAGAACA CAGCG^- A ' ~r~~ " GTGCGCGTGG GA77GTGGAA 12 s 0 

GGC3GTGTGG TCGAGCATCC GGATGC-Aa" -SZ-Z" TGCA ^CAG TTACGTCGAA 13 20 

GATCCGACCT GCrTTAAGCA CGKGT^ GCGCGGC — CCTGCCCGCC 13 80 

GCCAGCGC3G ACGGTTCCGW CGATCTGCG TGGTCC3GGC GTTCTTCGAC 1Ai0 

T3GC77GGCA 7CGAC7GCA7 C~GT~r~~: ^r^^ ATCG KTC5A CTACCTGCAG 15 00 



GGTTAC3ACA T7CGCGAC77 77ACAAGG~- ----."I^l! ACTCAC ~" C ~ GCGCGACGGC . 1S6C 

37C3CCG7G3 7CGACA — - --"^Zl^ ; --CAA7 TCGGCACCGT CGACGA7T7C 1S20 

AATCACACCT C3GA3TCGCA Zlt^Z^ -CA7CACC3A CG7GG7GA7G I S8 C 

7ACG37GAG7 A7TACG7G7G GAG—-"'-- *~1T^ GCGGCGACCC AGACGGACCS 17 40 

jA — - - :; «GC3AGCGCT ACACC3ACGC "GGA7CATC 




--"•■AC7GGTCA 7TCGA" ~~ t-^-„_„„._, „ 

.\"GA77C 77 *' J ^^•'C^uA 3TTNC7AC7C 

::jfcp>-.a7:on fcs .52.- := > ;c . . 3 



jHCChnce .thaka.tsr : o7: r; 



1EMG7H ... b2 oaG , : 



!3. T;?- acic 
G: 37HAI,XE::.\-E5o: ; lnq l a 



18CC 
1360 
13^2 




WO 9')/4207<> 



IT I I N')'> 0326N 



CGGTCGATCA TGGCGGCAGC AACGACGTC3 TC-C— r-a 

AAGGC CTTAT TGGACGTGAC GATCAAG CTG GCCCG^ AC— *00 

AAGAAGAGGT TGGCGGCCTC GGGCTCAAAC GGAa"~GT^C r~l^ CACCAGCTGG 960 

AGC3GATACC GGCCAAACC3 GG7GAG7-CG G^2tc~ MCCAC ^ 1020 

3CGAACCGTG CTAC~A— CSGC— ^ T^GATGv. CCCCGGCGTG GTGAGCCTC3 : 080 

GCGCG7ATCG CCAGGCC3AC fo^ol GT^S »° 

CACGACGTTA TCGCGGGCGG TGATGAAATC S^tSc SS^"* "00 

TTTGAGGCCA CGAGCATGCT CAAAGTCGAa" SSSS^ AGATGTGCGA TGGTGTC3CG 1260 

GGCGGCGCGG ATCCPGCrCT CATCaS^ Z^Z, GACTTCCGA A CCGGGAAGCG i320 

GCAGCCGGCC AGGTATTC- CgJS^ 1 

GGACACTGAC TCACGCAGGG TGGGAGcS C ^CAGCCG 1440 



!=) INFORMATION ?OR SEQ ID NO : I 9 

v'-) SEQUENCE CHARACTERISTICS 

■■A) 
3! 



A) LENGTH: 3^5 base paira 



1482 



WO <W420"6 



l'( T I (U20N 



82 



ScScS SgSS StSc" CZAACC ™^ cgagctacag 

3G.GGCCGCC TCTAGATCCC S^S^I 

TCCAGCCAGG CCTTGGTGCG GCOSTO-wr TCCCGTTCGC CCGCCCGTCT 

CGGNAAAAGT CGATGTC^ ££££2 TCCAGG <^ CCCGACCTCC 

GCTGCCGAGC GGTCAACGAG JS^^ 0 AGTACA ^GC CCGGCCCTCA 

GCGGTTGGCC CGACCGCOT ^SS^ ££7™*° CAGGO ««A GGGTCCCACG 780 
AACAACGTCG GCAGGAgSg £22£S ATCGGGGGGT CTTOGCGAGC 84a 

CATCAACACC GCACGGGATC StScgS GACCGGGGGG GCGAAAACGA " 0 

r™ ™ ssss ssss tsss sssz - 



INFORMATION FCR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 321 base pax-s 

'CI o.rtA^^DNESS : single 
:D) TOPOLCGV: l^ear 



SEQOEN 



DESCRIPTION: SEQ ID NC 



INFORMATION :OR SE? ID NO:2l: 
SEQUENCE CHARACTERISTICS: 



480 

540 
600 
o60 
20 



1021 



22 SSI: CGCAGGTCTG 60 

"GGTCGTAT ACCAGATGCA GC~GG^— ^^^^ rTTCSATC <- GGCTGGCGGN 120 

TCGGCCCCTG ANGTCCCGAC CGC~GC-~ G ^Z^f ^ CTGC CGTT GGACCCGGNA lao 

-CCAAC3T37 -GTTTGNGAA CAAGGGNA3~ Ifl^^ ^CGNCGAT 240 

■GGI.'GIJGNA T 3 GNCGANCACA A " "^^^ o^o.-ATCGG NGGNANCGAG 300 



-1 1 



WO 42(P6 



l'( T l : S<)9 /0326S 



S3 



AAATCCGCG 


TTCATTCGGT 


GCGG7TCGCG 


GGGCTCATGG 


CTAGCGTCCA 


G7TGCTTGCC 


— ' C C G C A C G C T 


CATGCTGGCG 


GC3C3CAGTC 


CGCAGCCCAA 






^^GACAGCGA 


CGGCCCCGAC 




- .-v- ^-^GTGCT 




^- _ ^CAT GAG 


CGACCCCCGG 


^ o~GGG GAG AT 


/-^AAA T A G G C G 


AC GGTTTTGG 




- - - w^. AG GAT 



^ cccrA ^ -gccatgag ga^gcac; 

r^GC— -3 A ^ C ; GG ACCCATCCTC GAAGGTCCCO 
^™ --CCGGCTGC CGGTGGCGGC GCATAGCGC~ 



::jf^r!-^t:g- for -d vc.--r 

- SEQUENCE CHARACTERISTICS: 

^ LENGTH: sao case pairs 
■3' TYPE: rmcloic acid 
— - 3 TRANDEDNES S : 3 mci 0 



50 
120 
130 



(xi j SEQUENCE DESCRIPTION: 3EQ m NO:23: 
GTGACGCGGT GATGGGA 7TC CTGGGCCCGG CCOG-CCQ— — 

L G 1 T ^ ^ CK «* ^ TTGC^Sc ^CC~ C^SS^- 
^..-A^vv. CTGGTACGGG TTGGCCGATT TAGCCGAGA" rjur"-- 
TGATCCATGC C3GTACC3GC GGTGTGGGCA -<3Gc5^G~ G^~™ C 

GC37GGAGGT TTTCGTCACC G C-jvr-— ™, GuA^TGCCT CGCCAGTGGG 240 

— -™ ££££ S— ;H 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 725 base oairs 
<B > -^E: aucleic acid 

STRAND EDNESS : single 
''-i TOPOLOGY: linear 

" : ^° T -- T -~ ~3CSI-r:ON: SEQ :d NO. 24. 

^-CGATA ATCGACGAAG TGATGAAGGG 



IBO 
240 
300 
360 
420 
4BQ 
54 0 



- ATG - atcgcgccgt CCGATGACAG so-- 

JSACA— GA- ^~ CT T GAAGGGAACC TGTCATGAAC S5C 
-omwv.^^ ^u^.GTTA CCCSCACACC GGTTGGCGGG — - 




WO 99/420~(> 



84 



iii SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 150 base pairs 
(BJ TYPE; nucleic acid 

(C) STRAND ED NESS ; single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:26: 

SSS^r~ SSSf" TGGCGGGGCC <^CGGTCG GCGGCAACGG CGGGGCCGGC 
oGTACCGCC. GGTTGTTCGG TGTCGGCGGG GCCGGTGGGG CCGGAGGCAA CGGCATCGC- 
GGTGTCACGG GTACGTCGGC CAGCACACCG GGTGGATCCG CGGCATCGC. 120 

160 

(2) INFORMATION FOR 3EQ ID NO ; 2 7 : 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : - 2 oase nairs 
'Rl TYPE ; nucleic acia 
'"C: 5TRANDEDNES3 : single 
(D) TOPOLOGY, linear 



(xi , 



SEQUENCE DESCRIPTION: SEQ ID NO ; 2 7 ; 



GACACCSATA CGATGGTGAT GTACGCCAAC GTTGTCGACA CGCTCGAGGC GTTCACGATC 



^FORMATION FOR JEQ ID :ic 



60 



D U 



r^^^i:: gacgatgggc gatgcggcco cgttcgcgga gg 

^' C3AC ~ "-"gggta attcataccg gaatggacoo ogtcgtcgc- ibo 

^A^CGC^A^C AGTGGGACGA CGGCAACiA. ' - — — i-- xBO 
GCCTACGACC GCAACGTACA GAG™: " ^ ^ lGuCSKKSG TGTCOTTGTC 




- INFORMATION FOR 5 EC. 10 NO ; 



WO 99/42076 



PCT/'l , S99/03268 



85 



— c - ™^ --CTC CGAAlxCgGGT 

3A3GTTGAGA TTGCC~GC — — — AC^GCTG CGGCTCGTCT ACGGCGGGC\ 
GAGACTCGGC GGTTAGG CAT TGAC^G^ — GCATCCTCAT 

CCGTTGTCAG ACCACGTG~T CGAAC— TGCCCGAC — TTTGGACGCT 
hCGTTTGG JrtrtC " T * 1 -ACCGGTGAA GCGCTACCTC ATCGACACCC 

:2 INFORMATION FOR SEQ ID NO : 3 1 ■ -'A 

£i) SEQUENCE CHARACTiRlsrrcs : • 
(A; LENGTH; 267 base oa- s 

TYPE: nuclei: acid 
■ CI 3TRANEEDNES3 ; single 
(2. TO pc LOG Y ; h n ea r 



xi: 3EQLr.NCE DESCRIPTION; 



^ NO: 31 : 



190 
132 



GATCGTGGAG CTGTCGATGA Arnr— *v-~ ~ 

^ 1 ^ ALAG^^TTGC CGGACGrrrr r=r^^.^r^» 
GCAGCGCCGG ACrACC~CGC -CG-Gr— — A^CGCG GCGGCCAGCA CGTCGGTGTA 

^ ^ AC^TGCGCG GCGGCGCCGG ACGCCGCCGT xao 

12) INFORMATION" FOR SEQ ID NO: 30: 

(a) SEQUENCE CHARACTERISTICS - 

fA) LENGTH: 303 base vzirs 
fBJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO;30: 
5St?5c G™.^^? -f?!^ 3 * SAAAGTC TGGGrGCC 



50 

I3C 
240 
30C 
308 



- ^- GACG AC 3 A 3CAACTCAC': - - ™^ „ „ 

33 3 3GGAAGC TGCC3~GCGG 3AACT^~— ^"-^C^^ -ATTGAGGAC GGAGAGAATC ^ 

3GCACCGGAC TATTCTGGTG TGC"GC~~'-'~ :i ' G ^ AGAG SC CTCCCGCGAT ::o 

ACACGATGAG GAATCACACC TACCGAG^A ^ GiAAAA - A ATCTGAGGGG : 8 o 

TCGACGCGGC AATCCAGGGC 3GTCTGG " ~ " JrtUrtitu ' -^^GACCTCG GCCGACGGCG : 4 q 

26" 

2 I ' IFO RMA T I ON fqt? 3EQ ;o *J C — 



WO 99/42076 



PCT/USW03268 



86 



^^ GrtC ^ u TGTGGGCTGC AGCCGGACGA AGCGGTGCTC GACGTCGGC^ "CCTrTr— 

-AT^CGCAG AAArr-U~ ^ GAACAGCGAG GGACGCTACG CCGGCTTCGA 
-ai^Lo^G AAAGCCATCG CGTGGTGCCA rcarr-^*^ 



: 33 : 



GACACC 3CCGCGGC 



^A'O A j. G * 



GGCCCAGCCC ACGCAGGGCA ICACGC; 
CTC3CCCCAT CGGTCGCGA — *r-~-,. 
\CCAACTCG GGTGTGTGGA TGAC""* * 



- ^ J ( uuvjI 



-TACG 3TATC """^r. — 



:"":rmatio:; fcp :; 

- SEQUENCE CHAP-ATEr 



540 

600 



a^cS? c^c^ ;:;'1^ c 1 c £*™**c cc****** 660 

G7TCACCCAC SSrcctc C^™ A JST"*"" GTGGTCT ^ C ™CCTCGGT 720 

^^<- C^rGAC^TGGA GCACTATCTG GACGAGATCT CCCCC^Tr-T 

?"cSS ^f"^ Crr™ CT C AATGACGAGT SS^SS 



840 

900 
960 
1020 
1080 



nr ._ _ fc ^^^t\x\jM.\j b^rtLLbtiCjiT ATCGGAPAAT 

TGGCAAPTT^ CTTGCCGGAG ACCTTCGTCA OmSSSSI 

ess ™ ~i EE iii EP 

CAGATTAGCC CGCCGCGGC- -~r—r—„ ^ C ° CTGC CGGCAGGCCG ATTAGGCGGG 1140 

GGTAACCACG CTtcSS£ ^-X^ ^TACGGCGC CCCGAATGGC GTCACCGGCT X200 

AGCCTGCGTG A-CGCTtt- Ir^^T ^ CAGGTGGTAG ATGCCGACAA I2S0 

CCACCCcS ^C^: ^ GCAGCCG ^GCACC AGCGCGAACG X320 

~CGGCATCAC GlS^A^ SSf^ TGACCAAACC 1380 

-aga— .™::r;: rrr^ 0 - 1 3ATAgccaag atgaaa A — mgggcac^ 1440 

3CGATAAUAA CCC7A™ 3CTTK3GGT GAGGCCCGTG ACCAGCTCCC 1SC 0 

,2: INFORMATION FOR SEC ID Nr ■ 3 t ■ 

_ . . : SEQUENCE CHARACTERISTICS: 
• LENGTH: S51 base oa-i 

(3) nucleic acid 

>' C ; STRANG EDNEG 5 . single 
:i>) TOPOLOGY: linear 

:xi: SEQUENCE EES CP I PTICN 5EQ — 

CTGCAGGGTG GCGTGGATGA GCGTCACCSC 
;^^'^' rrGCT -CGGCGGCCT ACGACACGGC 
ICCCGAGAAC CGTGCTGAAC TGATGATTCT 
— -GGCGATC GOGG^'^" ^ « " - — „ ^ „ 

3ATGTTTGGC TACGCCGCGG CCACGGCG/ ^ 
3GCGCCGGAG ATGACCAGCG CGGGTGGGCT 



GG G G C AGG C C 


GAGCTGACCG 


CCGCGCAGGT 






ACGGTGCCCC 


^ov_ ^. GGTGAT 




■j^v - ^a^jCGACC 


•-^A c ct cttg g 


GGCAAAACAC 


13 0 


- -j o ^ jmG ATG 


- j'j'jCCIAAG 


j-\CGCCGCCGC 


24 ! ; 


'jGCGACGGCG 


.^cgttgctgc 


CGTTCG AGGA 


3 0C: 


- ■ — Go AG GAG 


1 v - ^ - j ^ G C GG 


TCGAGGAGGC 


3 6 /■ 


--iv-vA C- iA T G T /I 


GCCCAGGCGC 


TGAAACAGTT 


4 2 0 


IT C C.AAG C T G 


GGTGGCCTGT 


GGAAGACGGT 




3GTGTC3ATG 


GCCAACAACC 


ACATGTGGAT 


4 r 












■■■> - 'TTGAAGG 


- ^ - G CTG7 








- GAT GAG CT " 




'GGGGGTGG ?j 


: i . ivi . . , .. CA 


'i.'^- * - GGGT C" 



^8C 
:3 4 ■„ 



W O W42()7b 



PC I I S99 03268 



GATCGATCGG GCGGAAATTT GGACCAGATT CGC— r — ^. ~ 

™™ — — ££££ ££££ 

CCATC^C GTTCGAGGGC £££^ — ^ ^CGACACOC 

GCTTGGTCAA GATC * ^TGAGCGAG GCGACGCAGT CGCAGGCTGC 

(2) INFORMATION" FOR SEQ ID NO: 35: 



fi) 



SEQUENT CHARACTERISTICS; 
(A) LENGTH: 1227 base Dairs 
£B) TYPE: nucleic acid* 

(C) 3TRANDEDNESS : single 

(D) TCPCLOGY: linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
— GAACCGGCC 



IC 2GGGGCGAAC GCCGGTTCGG 



6 0 
120 
13C 
240 
254 



C3ATC7GGCG GTGCGGAT^ ~GG— ^I-^ TTGG A C C A G G AGGGAC2GGA 
_ — —G., oo^GGTGC GCTGGATTGC GCTATAA™ 
^CT.v_oAC ^_^ACGC TGGATGGTGA C CAAAC ^ G C G — ~ 

SIS c- G ™::: CGCCGTAT " T **™'<™ 

^:^r ;S ACAA7 c ^ CC a ccggctcctg 

CCAA3ACC7G A™~ SSS^ ACC — C ~ T ^OCjUCAgS TACGAGCACA £ 
CGCCGCCGGC GGcIo^C CCGGGTGGCS 

AGTACACGAT TTTGTCGATC GAGGTCAC™ ~j^~f ^T^™* TATTGCSAC - ^80 

GGCTGC7CAG C~.GGC.AAG CCCTGATCG3 AG^TC G^™ —^f^ 5 - 

rac^c g~a^-:^: f — - tsccstw*, gctccgttgg 

TGCwGATCAG -v=~™~ ---^l^r - GCC3ACAC =ASCSC3AGG GCTACCACAA 

w s_j -^CGTCGCT 

GA7A7GCGGC GGGCGGCAGC 3CCGCGTCGT : 



ISO 
24 :■ 
30: 

42 0 



o 0 0 
6 6 j 



7T2T2C7AGG 3TGGTGGACG :~a™~~ Zl ZllZ^" — GGCATTGI 



3 4 [J 

90 is 



3CAA7CAGG~ Sgc^^ -CGGCGGGT 2CCG7CAGCC 

:tcaacgcga cggggcggg^ igcggcgc^ Itaa^^ga ?^:^ GCT 

»CGAAGGACG jAGATTTTGT 2ACGATC W "" rtV ^ ^ACCTTAGGA 1200 



-3 0 
.4 f! 



information f~p 



WHENCE 2. £AR A GTE R 1 ..." '2 1 .\; 

A -H:;gti: : : « ; n ac . , 
- 3.raneeon-E33. ^ — 



APOLOGY 



WO W420-6 



I'd/UN 1 ) 1 ) ()32(>N 



88 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 bas/pairs 
(3) TYPE: nucleic add 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

sss ssss s^GcS cggtgtcggc -™ 

OeSGCGCCTC CACCCGcS J^SS 

CCCCGGACGG CGCCTTCGGT GGOACGgS SSgSgG ^Sf' CGM6aVACB 
3CACTCAGA3 CGCCACCGGC CTCG,^ SSS ^ 

2) INFORMATI ON FOR SEQ ID MO: 33; 

! -> SEQUENCE CHARACTERISTICS: 
(A) LENGTH : .34 sase na- s 
<B) TYPE: nuclei, acid 

(C) STRANDEIXJESS: single 

(D) TOPOLOGY: linear 



x i , 



SEQUENCE DESCRIPTION: SEQ lD ' :0:2i 



SACCCAGTGG CATGGNGGGT CTCACTGGAA GCAT 
- I>fr ORMATION FOR SE V ID NO : 3 9 : 

SEQUENCE CHARACTERISTICS - 
<A) LENGTH : :55 ^se^aius 

(3) TYPE: nucleic acid 
■:C) STRATA EDNES S : cinql- 
■ D ] TCPCLOGV linear 

X1 SEQDENC2 --C-PTIC,,: 3 EC ID NO:39: 

IGGCCTGCTC 3CCAGCACCC — ~« I^IIT^ --^C^GTTA CCCAACAAGC 

''-FORMATION ECE .?EC . ; , . . 

SEQUENCE i:i\RA ^TE" - 



60 
120 
180 



290 
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(ij SEQUENCE CHARACTERISTICS . 

(AJ LENGTH: 132 base pairs 
(B) TYPE: nucleic acid 
(CI STRANDEDNESS : single 
(D) TOPOLOGY linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GATCCACCGC GGGTGCAGAC GGTGCCCGCG GCGCCACCCC GACCAGCGGC GGCAACGGCG 
SSSS ^ CGGCGCG TCGTCGOTGC GCCGGCGGCA £ 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE 'CHARACTERISTICS: 

{A) LENGTH : 132 base pairs 
(3) TYPE: nucleic at**. 
* C ■ 3 TRAOT ED NESS : single 
\D' TCPOLOGY: linear 



:?TION: SEQ :n NO: 41 : 



C3GNACGGNC GGGGAC3GCG GCAAGGGCGG NAACGGGGGC GCCGNACCCA 
Z:/" 1 ::^ — TCCNCCAA7G GCGCGAATGG CGGACAGGGC GGGAACOCCC 



■' ~ ) INFORMATION FOR SEQ ID NO ■ 4 7 

( i ) SEQUENCE CHARACTERISTICS . 

:a; LENGTH: -C2 oace nairs 
.'3) ./?E: nucleic acid 
: C ) S TRAND EDNE S S : z l ncj 1 o 
~; TOPOLOGY: linear 

3SC-JEMC- -E3CK:?T:-:J: 3£Q ZD NC.43: 

^::lr; AC " — — ~ — GCAGCTGCC3 ATTC3CCGGG TTTCCCCACC 

^LT"^ ga rKC3c ~ c = saagtagggc gatccgttcg cgatgccggc 

, - — — G-^, .•iCv.TTTJiGT TTAGCGACGA TAATGGCTAT 

-•Av_r^^v_ . nnu j .-iG jATSATf'" ""-A^A^' ,~~„ ^^™„ Rrt „ „ 

-,-,,™™ r ,„. _ ' wv - Jrt — ■■w J .-u^a^L „ j 4 GACGGT3 GATCAGCAAG 

-a : cacI- ;z:2I:: A ;r : - gcgatggg -sac races actgatgtcc 

^ JUJ ^ ^n^n w \ ■v/vmCGc::: -caacagntg gtnttgtccg 

--^r^rti^wA* jC3GGAA*AC "~ " - - -y • „ „ _ _ 

^ ' : J ' J ^ ;UV4 ^ >j - ^vj'_AG^3T CTGGCGACCT 

-^^^owj^vm -v_ 3 _GG C CAAG SNn~A~GV" 

_ -j~>-.i •^■--..vj/w^ ^.rtGo^.j^j AC2GC3CTGG 

, —.^-~o-.^ -^^G^.Gu oGCCGTCGGA GGGGACAGTT 

^j.wj^. rt^'-w^TACS "CrjAGGG^GG — ^-.^ ______ 

_ , ^^.^ >^> — ^C.^^C TTCATGGATC 

_____ JlJ ^-AC^uk^ C^CATCCCTC GCGCACTGNG 

1 INFORMATION Fl? j"'" — — 



132 



o 0 
L. 2 0 
.Hi- 



fi n ; 
6 6 ' 



WO 99/42(P(» 



I»CT/l , S99. , 0326S 



-v 

l'jij^ w jij-jGT GGC7C7GCT 
CGACGTTTAA CAAGGCCAG 



JL Jv. ^TV J _ 

— ctvjvjACTGCA 

.-.v_ j o ^ o ,„ _ A TT " j T G G AG 



GGGG CCGCAGCAGT G7?C3CAGCG 7CCGGGATAT IGGTCGCAG" 



-o^v. waA7 GGGGCAG7GG ATACAIT 
— -jGCGCAGTCG GGGTGGGAAG AA"" , ':rn 



G7TCGGC7CC AGTGAAC7A 
GGGCGCCGG7 GTAACGGGG 



50 
120 
180 
240 
298 



(Dt TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEC ID NO:44: 

GAAGCCGCAG CGCTGTCGGG CGACGTGGCG GTCAAAGCGG CATCGC7CGG TGGCGGTGGA 
GGCGGCGGGG TGCCG7CGGC GCCGTTGGGA TCCGCGATCG GGGGCGCGGA ATCGGTGCGG 
CCCGCTGGCG CTGGTGACAT TGCCGGCTTA GGCCAGGGAA GGGCCGGCGG CGGCGCCGCG 
CTGGGCGGCG GTGGCATGGG AATGCCGATG GGTGCCGCGC ATCAGGGACA AGGGGGCGCC 
AAGTCCAAGG GTTCTCAGCA GGAAGACGAG GCGCTCTAG\ CCGAGGATCC TCGTGCCG 

(2) INFORMATION FOR SEQ ID NO: 45: 

( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid' 

(C) 37RANDEDNESS : single 

■ 3; TOPGLGG'/ ■ linear 

xi. SEQUENCE DESCRIPTION: SEQ ID NC:45: 

^^ MrCGC GTCGCCGG3A GCACAGCGTC GCACTGCACC AGTGGAGGAG 6 0 

rr:;::::;^ C :^:^ GT aaccgggg - acccgcaagc gcagcccgca ggctcctacs 

^'°""" X ^ C ACCCTCGTTC ZCZCAZGCCG ATGAGGGTGC GAGCAAGCTA CCGATGTAC~ 
^^l^ ^1^™^ —GGTCTGG CTGCGTACTT CGCCAGCTTC GGCCCAATGT 
3^;^ GTC GGG ^^SG7G AT'GGCGCAGT G7CCGGTGAC ACTGGGCTGC 
^CTGCGCTGC TTGCCGGGGT ggttctggtg cctaaggcca 
■j^-GGTGCTCG GGGTAC7CGG C3TAT7TCTG ATGG7CTCGG 
o^GTATTCGA CGGGTTGGGC ATTGTGGGTT G7GTTGGCTT 
.377 GCGGCAGTCC TGGCGC7CT7 GG7GGAGACC GGCGC7ATCA 
-CCC AAGTTCGACC CGTATGGACA GTACGGGCGG 7ACGGGCAGT 
r-A^. CCGGG7GGG7 AGTACGGTCA GCAGGGTGCT CAGCAGGCCG 



120 
180 
243 
3 00 
3 6 j 
420 
4S0 
540 
6C0 
6 6 0 



^rtt.^.. . ,G 3;; 

ACGCCACC7A H 4 '.■ 

^ -jvji CAG73 C C3GGAI3GGG 7CGCAGGC7G ^00 

AC CCC A GCGGGGGCGA GCAGT IGTCG TCCCCCGGGG ;^6 C 



■ JA -----»^-'- CCGGTCGCGC GTG7GCGCGA AGAGTGAAC; 
jou.ji^AGC AAGCGOGGAC GATCCTGGTG CCGAA""**" 



i/jc: 




WO 99/42076 
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i2) INFORMATION FOR SEQ ID NO:47 : 

:i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH : 170 base pairs 

(B) TYPE: nucleic acid 
<CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

SSS ££S 5^ ?™ « c ™ -™ 

T^TC* ^TCG^TO ATCTTGTTCT „ 0 

17 0 

C) INFORMATION 70R 5EQ ID NO: 43: 

'i; SEQUENCE CHARACTERISTICS: 

U\) LENGTH: 12 7 base pairs 
(3) TY?E: nucleic acid 
fC) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

..x:: SECUENC2 DESCRIPTION: SEQ 13 MO: 48: 



-'--JGGGGTSG ~GGCGGAA-~ — ~ ~XC^ ■'-~-"3CTGG CGCTGGCGGC AAC3GCGGGG 

^.^^ TGTTGGGCAA CGGCGGTGCC GGCGGGCACG 



IIIFORMATI GM -CR SSG- ID ;, 0 . ., 3 : 

1 SECUENC" rKARAGTERIET'^" 

2' ^ iVGTH: "air: 

3TRANTEDNESS . cinq'*- 
iD) TOPCLCGY: linear 



SEQUENCE DESCRI?'-:^;- 3"- — ja 



JvJv '^ jG"CTCAAC 



60 
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INFORMATION FOR SKQ :o NO : 5 1 

fl) SEQUENCE CHARACTER I ST T CS ■ 

(A) LENGTH: 355 base^pairs 

(B) TYPE: nucleic acid 
(CJ STRAND EDNESS : single 
(DJ TOPOLOGY: linear 



(X:i SEQUENC - DESCRIPTION: SEQ ID NO : 5 1 : 
CGGCACGAGA TCACAC^TAC P-sr--^^^ 

ACGCGGKAAT CCAGGGCGGT £££££ ««*™TC° 60 

7CGAAG7ACA G7CAATTCGA GGCGACCTgI ScSgC r-S^ '«0 

G7A7GAAAGT CGGCTTCCGC C7GGAGGAT7 ."S^- ^GGTGA 180 

r2I:™I 77CCAGAACA S^O CGATAACTGA 240 



CGTT^AucCG 300 



" "~ 'WCCCTGC3 ACAATTC3TC GGCC 
INFORMATION FOR 5EQ 10 N0:5 2: 

- * . SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 9 59 base oa^'rs 

(3) nucleic acid 

(C) STHANDEDNESS: single . 

fD! TOPOLOGY: linear 

SEQUENCE DESC^'^'^^ ~ 

^ . ot>w ^ D wo : 5 1 : 

IATCACC ATCACCATCA CA— ™^ 

^w^u ^riCTGGCTAT ~G~ — ™ * " ^v_^^ ICavJUGCGA 



15b 



-^-^ 'waLTGGCTAT ~GT' — ™ " lw ^^ J * w^^u^o^(j 

-^CGACCG CGAAC3CCGA -"r^r ^^UA GCC7GGTGAC GG77GCGG7G 
"CC37CGA CCGC7GCAGC G^--- ^I? 3 '"^^ GGCCGCCTC3 [go 

^^TCCcI ACG^gSS SSccSc- -™ 
-AACCCGG ^GGAGgItt CAGC-™ -5™^ ^ C " C ^C TGTCC3GATC - 6 "o 
—AC77CG ACTAC3G7TC AGCAC7™ AG^^' 3CTGGGTGGA GTCTGACC 



3ACAGCC3C CCCCGGTGGC CAATGACAC'* ir^" ^CATTTCCr 

--TTTACGCCA GCGCCGAAGC CAC7GAC7~ A^G—- ! AGACCAAAAG b , 0 

"-3AGTTC7 A7ATGC CCTA C7r" - -GGG C7CGGACATG D . ;c 

"•""•■-A~GGGG — -~~ — . . ,„ " ~ -*~ A -'-<*A'"-'" -\GGAAACC37 — 

- - ^ - ^ j ' '"" , „ _ „ „ „ r „ , "~ * ~ ^ * ^- ^ 7! t> C 

- — -^w_3GAC 3GGC~TAA"*~ ZZ — ----- ' rtL3 ^0 

3GGC "CCT'J AGCOCTGGTT TGTIGTA";'-: — ^'"-"^'--^ ACCGGACGCC h - 

;'^;^^'"" rA A ^"SCTGGC CG/iAT.-GAT'J - xr '^L-, ^ ^^""X^^^^ JGT:]GACA AG 

-kj 1 . -TGCAGAGC? 7GC"""~'j.- — - ~ " " ' J - - - j jCGCCS 9q<j 

scgac;acac cgacaccgca gcgga-™"" ^™^I:i;^ 7 " 3GGGAagt -GCTCCTACC 9^0 



480 



Nr".p:<ATi::N for ^fq io v 



q Q q 



WO 99/42076 



PC I/LS99 03208 



Met His Has H^s H^s u 

J- «... Hla Me L Hi , ,1a Vai Asp Pro Asn Leu ^ 

Arg Arg Lys Giy Arg Leu Ala A l , - 

-Ser Leu Val Th rVal P-AU^^.^ 

sT ^ 5 T - - - - - - - Ser Thr 

Ala Ala Ala Pro Pro Ala Pro Al* » 6 ° 

65 70 Pr ° Val AIa Pro Pro Pro Pro 

Ala Ala Ala Asn n>- , 75 an 

£ Pro Asn Ala oln Pro Gl y Asp Pro As, Ala £ 

Pro Pro Pro Ala Asp Pro Asn a', a 1° 95 

" ^ f~° Pr ° P « P - Val He Ala Pro 
Asn Ala Pro Gin Pro Va i Ar~ - , T , 1" 

115 " " e Pro Val Gly GU- =k„ «..,_ 

?he Ala Leu Pro Ala 31- " 



-7 5 ''* ai ~ a — H« ?hfs ASD 

Tyr G-v Ser A 1 a r 140 

i , - ' " U Leu s - r -vo "-h- »w r-L, , 

^ 152 • G1 / Asp P ro Pro ph „ ?rn 

Giv Glr. P-i p--, r>,„ ,, , 15i 

-- ~'o Val Ala A=;- ~ u laa 

'VP lhr Arg lie Va' t„,. . 
iD = » , 7n - e - Ga / Arg 

-eu .Asp Qla Lys Lou Tv- a" i * i?5 ' 

■ *** 1^ Glu ^ a Asp Ser Lys Ala 

Aid Ala Arg Leu Glv ^ , _ _ _ 130 

195 ' * " '^ e ~ GIu Phe Me: Pre Tv>- p>- 

Gly Thr Ar 5 Tie As- ^, 2 °5 

"10 * ^ 3er Leu Ala Asn Gly Va * 

Ser Gly Ser Ala Ser Tv r ^ 220 

^ n giv .I, :;: _ . 23 > J40 

^ A± G1 -' Pre: Ala Ala Aan 



230 " ^ Ser p ^ Lys 

240 

" J : ' djL CJiv Ser Pre; All 

Al a 



^ Asp Al, , 250 



:6C " f " 3 * :: ? r7 ?he Vai Vai Trp Leu Giv 

— ^ .-v^a Asn p ro - . > „^ . 2 7q 

-"5 * " /S " 7 "' y Ala rvla -VS Ala Leu Ala "in 

■^er :i_ Arg a— : „.. , " 285 

= 9C ? ro Pre Ala ?ro A ; a Pr . 



IP-MAT 1 pep 



A; lENGTli'. 



: a: 
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5 

Val Ala Ala Leu 
20 



10 



121 INFORMATION FOR SEQ ID NO: 55; 

<i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: is ammo acids 

(B) TYPE: amino acid 
(CJ STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi J SEQUENCE DESCRIPTION: SEQ ID NO: 55; 
Ala Val Glu Ser Gly Me- Lpv :i a T 

. ^/ Me. Leu Ala Leu Gly Thr Pro Ala Pro Ser 

10 , ? 

:.' INFORMATION FOR SEQ ID NO : 56 ; 

< i ; SEQUENCE CHARACTERISTICS - 
(A) LENGTH : 19 amine a— 
O) TYPE: amino acid 
<C) STRANDEDNESS: 
(3) TOPOLOGY: linear 

. xi, SEQUENCE DESC^"^"^* ■ ~— — , v _ 

' ir ''« — ' ^0:56: 

Ala Ala Met: Lvs ^ro A.-- ~i 

1 " T ^ " Y ASP GIv ?ro Gin Ala Ala Lvs 

-j-- ■oj.y Arg lb 

INFORMATION FOR — — 

- SEQUENCE CHARACTER I STIC3 

:A; LENGTH; ii ammo acids 
iB! TYPE: ammo acid 
(C! STRANDEDNESS : 

TOPOLOGY- linear 



N'FORMATIDN "OR 



MU : 



S EQUENCE ~ IARAC~E R T ~ ~ - ~ c 



WO 99/42076 



PC I l'S<>9 032(>S 



SO 



Asp He Gly Ser Glu Ser Th- G^r ^ ~i ^ 
1 r ^ v^ln Gin xaa Ala Val 

10 

) INFORMATION FOR 3EQ ID NO:59; 

£i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: i 3 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ lD NO :59; 
Ala Glu Glu Ser Il e Ser -^h- v. 

, ^ oer .hr Xaa Glu Xaa lie Val Pro 

5 10 
INFORMATION FCP W Ir; ;;o - 

SEQUENCE CHARACTERISTICS ■ 

! A ! l ENGTH! 17 atnino acids 

^3 ) TYPE : ammo acid 

(C) STRANDEDNESS - 

>'D) TOPOLOGY: linear 

As? Pro Glu ?r= Al, Pro Pro Va : „„, . , i 

- ~ * ~ Axa A* a 'Via Ala Pro p-n 

10 



information for seq :o :*c :b : 

^ EC T JENCE IHARAC"^ ' ~~ - ^ , 



^ ; -Yr-E: ammo acid 
~: ^RANCEONESS : 
■~) TOPOLOGY; Imear 



3ECtJENCE OESCRIPTIOV- 



- 3EQUENCE I KARA CTE R 1 3 T I C S 
■Ai LENGTH: Jc ammo 



WO 99 420~f> 



PC I I S99 (J326N 
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-eu Leu Asn Asn Leu Ala Asp Pro Asp val Ser Phe Ala Asp 

30 



20 25 



12) INFORMATION FOR SEQ ID NO: 53: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 ammo acids 
(3) TYPE : amino acid 

(C) STRANDEPNEPS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Thr Gly Ser Leu Asn Gin Thr His Asa Arg Arg Ala Asn Glu a« .Lv: 



5 

As 



10 15 



'hr Thr Met Lys Met Val Lys Ser lie Ala Ala Gly Leu v. 

* - 2 ? 25 "30 

' ^ d — — a —a ,,-y : a . Thr s-r ll„ .Me- .Ma 

3S !0 45 



Gly Gly Pro Val Val Tyr Gin Met Gin Pro Val v.: Ph „ GIv A - a o~, 

3 55 6C 

Leu Pro Leu Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin 



70 



80 



^ U ' hr LeU Leu A£;n ^ er Ala Asp Pre Asn Val Ser A : a 

85 90 95 

nGn uys .j.y Ser Leu Val Glu Glv Gl- '-^ - ^ r-i 

^i. .r. ui/ Jiv Thr Glu A^a Arg 

l-e Ala Asp His Lys Leu Lys Lys Ala Aia Glu His Glv III Leu Pr:, 

:lc 125 
-eu Ser Phe Ser Val TV" As- - , . . , 

^ e ' J - n Pr - — * A. a Glv Ser Ala 

135 -, l3 

^ a A2p Val Jer Val -V Lys l*u Ser Ser Prn Val — 

3ln Asn Va. "a' — - ~" '° 0 

^ * j -y — P Met .eu Ser Ar- Ala 

■ — , 



ia Met Glu Leu Leu Gin Al 



180 ;gs 



a Ala 31-.' Xaa 



INFORMATION FOR SEQ 



A LENGTH- MS ar.ir.c ici. 
^ "TY?" : amino icid 

r TOPOLOGY linear 
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50 55 60 

Ala Gly Arg His Pro Asp S er Asp He Phe Leu Asp Asn Val Thr Val 
65 70 75 

Ser Arg Arg His Ala Glu Phe Arg Leu Glu Asn Asn Glu Phe Asn Val 

85 90 95 

Val Asp Val Gly Ser Leu Asn Gly Thr Tyr Val Asn Arg Glu Pro Val 

100 110 
Asp Ser Ala Val Leu Ala Asn Gly Asp Glu Val Gin lie Gly Lys Leu 

115 120 12S 

Arg Leu Val Phe Leu Thr Gly Pro Lys Gin Gly Glu Asp Asp Gly Ser 

u "° 1^5 140 

Thr Gly Gly Pro 

145 

I INFORMATION FOR SEQ ID NO-.6S: 

SEQUENCE IHAi^JTKRISTICS : 
(A) LENGTH ; 22 0 ammo ac:-s 
:3} TYPE, ammo acid 
(CJ 3TPALTEENESS : single 
(D) TO TO LOGY : linear 

;xi; DESCRIPTION: SEC ID NO: 65: 

Thr Ser Asn Arg Pro Ala Ar^ A 



Pro Asp Arg 3er Ala 



~ly Arg Arg Ala Pre .Arg Asd Thr 

10 ~ ls 

Ser Leu Val Ara His Arg Arg Gin 

- a 25 30 

^" - ^ a ~« u CV3 ^- Ser Thr Gin He Ser Arg Gin Ser 

1"""" * *" ''" a ^ v ^s-i Tyr Ser Arg Arg Asn 

^ ° 55 5 0 

------- :/he ' et: -eu 7a ^ Thr A^a Val Val Leu 

r u ..... „ J . "s 

" e J ^-' *'^ a Thr Ala Ala Pro Lys Thr Tvr Cys Glu 

^ H 95 

Thr Asp Thr Gly Gin Ala Gys Gin lie Gin Mer Se- 

, ids no 

Asn He ;;er Leu Pro Ser Tvr Tvr ?r : Asp 

' " vr Sir. Thr Arg Asp Lys ?he :.~>v 

'"" r ? ^1 U A— a Tyr Glu Leu Asr: 

k - • — Tyr 7 In Ser A* a lie Pre Pro Arzr Glv Thi G1-- 

I]] C 175 



o-.u Leu 



WO 99/4;(P6 
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(2) INFORMATION FOR SEQ ID NO: 66: 

i'i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 13 2 ammo acids 

(B) TYPE : ammo acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: lmear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:66: 

Thx Ala Ala Ser Asp Asn Phe Gln Leu GJ , n cly ^ ^ ^ ^ 

Ala lie Pro lie Gly Gin Ala Met Ala lie Ala Gl y Gin Iie L! g 3er 
GlY S " ^ ^ ^ ^ Gly Pro Thr Ala Jhe Leu Gly 

f 0 Y V ^ ^ ^ Sly Ala Arg £ ain ^ 7al 

Val Civ Ser Ala Pro .la aL Ser Leu 31y 11. Z Thr Gly ^ Va , 



r,r Aia v a : Asp o: y AIa Pro Ufi Asa s ? f r Ala Thr Ma ^ £ 

Asp Ala Leu Asn Gly Eis His P>-o a™ v i n 

100 ■ " -° ^ As P 7al Ile Val Asn Trp 

3ln Thr Ly S Ser Glv - ~ l °* 110 



TU >- 



115 " 7hr G1 >' Val Thr Leu .Ala Gl^ 

-O' Pro Pro Ala ^ 5 

13 0 

INFORMATION FOP SEQ - D MO : o " : 

■ i • SEQUENCE CHARACTERISTICS ; 

■ A; LENGTH : IOC ammo "acids 
; 3> TYPE : ammo acid 
■C; STRANDEDNES3 : smgle 
>:OJ TOPOLOGY: lmear 

.xi, SEQUENCE DESCRIPTION; 3 HQ - c . — . 

— - Leu Ar7 ser Pro Me" - ~>-~ 



3 5 ^ /wa ^ 

:r j,l - »r - - - ^ - , 

^r Ala Sly MeL Ala Ar^ v..n Ar _. - ^ 



L v :r Xaa 

6 C 

Xaa Val NV- Pre Xaa Val 
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fiJ SEQUENCE CHARACTERISTICS: 

(A] LENGTH: 16 3 amino acids 
(3) TYPE : ammo acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION : SEQ ID NO:68: 

Met Thx Asp Asp lie Leu Leu He Asp Thr Asp Glu Arg Val Arg T hr 
Leu Thx Leu Asn Arg Pro Gin Ser Arg ten Ala Leu Ser Ala JL Leu 
Arg Asp Arg Phe ?he ^ a Xaa , eu ^ ^ ^ ^ ^ ^ ^ ^ 



He Asp Val Val 

5 5 



40 45 



30 eU G1 Y Ala Asp Pro Vai Phe Cys Ala GIy 

vn :ai Ala Gly Arg Ala Asp Arg Ala Ala Glv H— v-< 



-eu Aso 

" "° 75 g 

^ HlS AS? 3lR «y Asp Arg Arg Asp Gla ^ q 

^ 3 ^ A1 * ™ ^ Arg «L Pro 

Aso Arg Leu Arcr i 1 ^ ^ . , LXU 

- * A., A^ ?ro . eu Arg Arg His Pro ^g ?rc Glv Glv 

- ^ "° 12U 1""5 

^a A^a .Ala H13 L^u G 1 - * „ ^ Tr , , 

130 ~ Aia Ala Lys GIy Arg 

Hi ' 
14 : 



40 



Arg Xaa Glv Pro Val Asp Glu Pro Asp Arg Lg Leu Pro Val a~t 

^ c 155 ~ 

rtsp Ar^r Ara 



INFORMATION FOR SEQ - NO : * 9 : 

rt; -.i^NGTH ; 3 4-t ammo aci 

3) TYPE: amine acid 

C: STRANEEDNESS : single 

:E; TOPOLOGY: linear 



:jo • 




W O <)Q 420 n 6 
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017 a.p p„ ^ n Ala Pro ^ ^ AIa Trp Aia ^ wy ^ ^ ^ 

£ Gi> ' P " AI ' 3 ^ ^ -P Val S ;Ua Giu ^ 

J-u Gly Thr Ala val Oln P he His Phe Ile Ma £ Leu ^ ^ ^ 

Leu Leu Asp Glu ^ P he Leu Pro Gly Gly £ Arg ^ ^ ^ ™ 
Mec Arg ^ ^ Gly Gly Leu -° ^ v ^ ^ 175 wa 

Hi- Arg Pro Giy ^ aer Thr ^ £ ^ »» ^ ^ 

-P j.p Leu Ala Trp Ma £ ser ^ - ^ ^ ^ 

A^a A_j.a Leu Ser Hi c- n; P - „ 

22 5 ijeU ^ P Thr Wa p ro His Leu Pro Pro Pro 

Thr Arg 3i „ Val Vai Arg Arg Val Val Siv "r -m w , . =" 



4 



250 



Met Pro Met Ser Se- A-a . Z 255 

** p 3iu : — 5 "hr Ala Giu Leu ? — 

16;" 



- b J 



Ala Asp Leu His Ala T u.. . r _ T ** ^ , , 270 

275 Leu Lcu Leu Thr Gly Leu Aia 

d i ^* ® ^ 2 3 ^ 

,ro His G in /a, Thr Asp Asp A^ Val Ala Ala Ala Arg Ser Leu Leu 

Asc Thr Asp Ala Ala Le« G1 .. v . 3 °° 

3 05 *~ - A ~ a Leu ^ a Ala Ala Phe Thr 

Ala Ma Arg .Arg :i e Glv ^ ^ ^ b 

3 ^ q " " lJ ^> r Aia A^a Ala Glu Gly Gin 

Val Ser Arg Gla A sr - 33 ° 335 



INFORMATION FOR 3EQ L^ \ T G - ■ 

~ GEGUENCE 2ILARAOTr:p:;jT:i*3 

'.A; LENGTH. ^85 im ir.o ac:a: 
'■21 TYPE: amir.o icid 

^) TOPOLOGY, linear 
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Asd Gin T*~- r; 1 t . 

• „ «* tor «r »r *, „„ „ Phe „. 

- j- ... to ^ „ ,: e ^ Pro _ ser - ^ Lm ^ 

- s " •* - it: - - - «r £ w Pro 

He Glu Asp Ser Leu Gin Ser He Phe A! if* 16 ° 
ig5 Phe Ala Tor Leu Gly Gln AJa ^ 

21u Leu Gin Arg Ala Gly G l y G1 „ Xhr ' ° 175 

180 y 7 ™ ^ ^ Phe ser Hls Leu 

Arg Pro Ala Glv Asp Ara v^i », <- 19 ° 

195 ' P " a Ser Gly Gly Thr Ala Ser Gly 

?ro Val ser Phe Leu Arg Leu Tvr ^ Se _ „ fll 20S 

210 Se - Ala Ala Gly Val Val Se- 

Me= Gly Gly ^ ^ ^ ^ 220 

— 5 :30 ' ^ a Val Leu Asp Val Ser 

His Pre Asp :i e C ys Asd ?h^ ^ 235 240 

MS * '"^ " YS Ala GllJ 5er Pro Ser 



Leu ?ro Hls Asn leu ^ 

260 

Ala Val Glu Arg Anr. Glv Lei: u . 



2 60 uiy /al Thr A - S P Ala Phe Leu 

6 5 _ ^ 



"S - ;- e ' — *-S -cu Val As:: Pro Arg Thr 

Gly Lys He Val Ala Ars Me . D „ ,. _ 285 

- 9 C " 1 ^ a Phe Asp Aia U e 

C - /s L ^ Aia Ala His a"., n:.. „ . 300 



='/s Lys Aia Ala His Ala Gi 



3 05 



313 ^ P ?r ° LeU Va: Leu Asp 



"~ A5n ^ AU Pro Val Pro GW ^! r„ a , 320 

32 5 , ' ^ Gi > ^le Glu Ala 

""^ A<— ~, 3 30 

" ^ y Glu Va: Pro Leu Leu Pro ^ G ,„ q ^ 

34 0 G — Ser Cvs Asn 
* ,i - , J4; . ^ „ 
ser Ar.rs . 



353 A ^ 7^ M2 = Asp 31y Arg Val Asp 

nGc r^rg Lou — , 355 

•to - a - -'- a 3: '-' Vai a: a 7a. A -g . , 

-eu .-LSD 

Val :i e ^ p val Ser a.^ ^ =■ 3 

235 390 ' ?rC ^ 31-, .Ala 

--a Arg Ala Thr Arg Lvs -„ • „, 400 

" " - ^CU J IV Vri ' Mpr 

405 * n^a Glu 

— J --eu Aia Leu Gly :.;.-» - Jr3 . ^, 

1J 3 ^ — l -ii^J ;;: r — ^ 

Ar 7 -Ma v.- . _ .. . ^ 

^ ?r; Ala Phe Thr A^p 

42 : -47-, As '" A;i -a: Thr 3«r 

Vl - - d ? " --^ 31 v 4?5 430 

48? 
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(D) TOPOLOGY: linear 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 : 

Qly val lie Val , eu ^ Leu Glu ?rQ ^ Qly ^ ^ ^ ^ ^ 
Ue ryr Trp Ar 3 ^ ^ Qly Leu w- ^ fiiy ne a ^ £ ^ 
Val CUy n e AIa VaJ Ma ne » ^ ^ ^ 3^ ^ ^ 

Gly Ala Lya Pro Vai ser ^ " p Lyg ^ ^ ^ ^ ^ ^ 

P» Gly Ser Pro Ala ^ 0 31a ^ prQ ^ ^ £ ^ ^ ^ ^ 

Cly *sn il d Ala Ala Aia ?» Pro 3 u s , 7 a „ , 80 

ro J " • Gi / Asn Pro Glu Thr Pro 

Thr Pro Th- a- , - , ^ 
- - J . :: ?rn ., ; 

, ^ Asp 

Asp Cys Pro Asn Ser Thr Leu Vp • - "° 

115 ' ; /a * " Thr Asn Ala Pro 

r ^ ■■ „ ^ „ 1~ 0 , - ? ^ 

~" ^30 '"^ G1> ' AS? ^ P « **» M- vll val Thr Asn 

lie Gly Leu Vai Ser Cys Lys Arg A.o Vai Gly Ala" Ala Val Leu- Ala 

Ala Tyr Val Tv- Se- T *-*m v„ ^ 1:35 16 c 

* " 0 5 ? " /3 ^ g LeU T ^ 3 ^ Asn Asp 



170 



Fys Ala Pro Ser Asn Glu ~^>- t^m v.' r 

133 *"* " eU ~>' 3 Tnr ? ^ Ser Pro Gly Glu 

Val Thr Th- Ala -a 1 ^ 190 

i9b ' **" G1 *'' G1 Y Ser Ala Pro Arg 

"vs d-^ - „ 230 205 
eu ?*--. ,, rcT ?r . : , a d _ _ ^ ^ 

Vai Leu Jlv A^ n Leu V~ o - ^ 

r • - - j-eu Pre /a. Pre 



p ^ ^ J -c Pro ?rz Gly Pr; 
2-15 

— ^^u jor Pro Al. 

260 

:::fcrm^t:^; JEC :r ::c 



V-hI Pro A. a p ro r-i 



?ro Gly Pro Al. 



"£Ap.a;;thr:: 



3 l an:r.o acrid 

tgpci.:gv. :i . ear 
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^ Val Asp Asp at, Pro lie Asn Ser Ala Asp Ala Leu Val Ala Ala 
Val Arg Ser L ys Ala Pro oly Ala Thr Val Ala H Th r Phe Gla ^ 
Pro Ser Gly Gly 5 Arg Thr Val QIa Vdl £ Leu Giy Lys Ala H 
Gin 90 95 

2) INFORMATION FOR SEQ ID NO: 73; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 364 amino acids 
(3) TYPE: amino acid 
(C; 3 7RANE EDNES S : single 
TOPOLOGY ; linear 

;xi) SEQUENCE DESCRIPTION- C£q - d ■ ^ 

^ ^ ''^ f Cr " eU ^ ^* -V Thr Leu Val Leu rhr A.a 

13 1 0 

Cvs Glv Gl'' G 1 " a<rr , c 15 

7 n b " Se ^ A* a Giy Gly Thr 
2 5 

Gly Ser Val His C^- ^i,, . T 30 

35 ^ ^ r * 7 ^ S ^ s Glu Leu His Ser Ser Glv Ser 

Thr Ala Gin Glu Asn Ala Me- „ , 45 

.0 - J *" PhG V ^ Tyr Ala Tvr Val Arq 

J J so 
^er ^ys Pro Glv t^,. , 

■S3 ■ :r ^ " bp Asn Ala Asn Gly Ser Gly Ma 

- - 7 5 

■j-v Val Thr G 1 *"< - , > „ 8 0 

9 ;* S ^ ^ rtSn 3 ^ T - ^ Ala Glv Ser Asd 

• a - ?ro __,eu Msn Pre „ ^ ^ 

"~ JA ' ^ n Arg Ser Ala Glu Arc 

-/-" Ser ?r:.; Al i 



Aal Phe Glv 'J^- 



Thr Tyr Asn ^e \ vs G'-- ^ ^ , " S 

130 ,i " ^ eU Asn Leu A&p Gly Pre 

fc - rtia -ys ?h« ash si-- - a , m 

145 , ' - - hr /A1 Tr ? Asn Asd \- 

^i- xxe jlr. Ala Leu ; 3r ^e- -u- ^ - 

--er Val 11- p nti ,^ , . . ; ^ - . ' 

:bc " * - ;t ' r Asp Asr; P:; ^ 

_.v f - . " J D 1 yc 1 

19 5 V ' * " ^ " w -'' ' ' " /n ^ ^ V A^a Ser 

■ -..„.., — " ^ c; 

212 ~ 31v Ala ^n Asn Glv 

Thr Ser Ala Leu leu 31 :: Gr. r -u^ - , . 

^ ^ ^ * ^ ^ ~ * ^ ^ - . » r T"yr As n. Glu 
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290 

305 310 



2 Q q 

^ 300 



Val Leu Ala Thr ^ 31u va: Oy s Ser Lys ^ Pro tep Tar 

Thr Cly Thr Ma Va: Arg ^ Phs Mec Qia £ Ma ^ q ^ »0 

33 0 

Gin Glu Gly Leu Asd Sin Glv ^ n 335 

340 ' * la Gly Ser ..e Pro Leu Pro Lys Ser Phe 

Gin Ala Lys Leu Ala Ala Ala Val Asn Ala He Ser ^ 
355 360 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 09 ammo acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

',xi, SEQUENCE INSCRIPTION: ZEQ ZD ^C-4; 

^ T y ^ Vai ^ ^ Thr ^ H,s Ala Glu Asp 

10 i - 

-^n .nr His Gin Aso Arg Leu H^s "-s — n n 

20 ' ' ° c, / !i A *s Arg Ala ^ a Val 

- -a. Arg 31, As, Arg ,.a Ser Val Ser Ala Thr Ser Ala Arg Pro 

4 5 



Pro Arg Arg His Pro AJ a g<- H — -v , 

50 ! ?Vr ? ^ Arg Val Ala Pro Ser 

^rg Arg Aru. Pro His Pro H 1 3 »^ ~ , _ 0 

55 ?Q ^ ^- ri Asp Asp Arg 

Arg Asp Ara Pre Ala L^u Leu Asc Ara g' r 3 _ ^ u 80 

u . — j-i. y.: JiU u :3 ;j ro 



95 

Asp Pro His Ara Arg Slv 
100 



95 



Aia A ^ Arg Va: Arg Gly Arg 



^ ^ ^ Asp Asp Sly Arg Leu 



110 



12 0 



Pro Asp Arc Asp 



Al.i Asp His Glv Ala Pro Val ^g y ,. ^ ^ 

130 " J ^ — ^ Arg Gly Val 

^ " " '40 

3 - r. His Ar a G 1 v G 1 ' ' p *- " , ■ D ^ - - , 

r '** h •»* ~™ A " Va: Pr: Slv Val Arg 

i Ar~ ^ . . . . .' " : "> ■■ 

' •• ' .td ::y r, v 

' ^ A: ; \: ■ - - A ^ ? y v^ a:., 

;a • SI- Asn ^eu H " z A— -1.. - .. . 

'"^ ^' ^ ^ A. a Asc Gly Ara V ti : 

20S 

2 ^ d "'-'^ '' ro Ar? Arg Jer Arg 



?: - A " ?rc Arg Arg Glv Ala : 

Ar-j Leu Hi. ; 



A-^ S-y Pr= S-r ?: 
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275 2SO 



-a Gly Val Ma HiS ^ ^ Qly ^ ^ ^ £ ^ ^ 

300 



Asn Arg Pro Arg Ara 
305 



(2) INFORMATION FOR SEQ ID NO: 75: 

U) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 580 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 
CD) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO:7 5; 

Ser Ala Val Trp Cvs Leu Asn civ ?>- -v~ 



<rg Hi 3 Arg H; 



Arg Cys ^ v ^ Arg Ala ser Glv ^ Ir^ Sp^ ^ ae , 15 

-«q ~ ^ er Asn Arg Trr; Cvs 

" 3 f ^ CyS AIJ S « ^ Thr Pre T- Gin Ala Ala 

Ser Pro ,. u Glu ^ ^ p , e ^ ^ ^ r ^ ^ ^ ^ 

Arg Phe Arg 3er Phe y^r Ar-r . °° 

65 g Ar? " eu Ma Leu G1 V Ala Arg Thr 

Ser Arg Thr L«u C v v^' * r- > ' 5 30 

u Ar- Thr Leu S^- G ■ - a „„ - 

- Asn l,c?u Ser 

90 

Pro Ara Ala Gin s e >- - . r ^ 95 

, n „ "° ^1 Thr Val Glu Ser His Thr s 



90 

Ser Cys Ala Val 

Ala Ser ^ro Ar~ Mer Via - -<= ^ . 

. y s .eu Arg Va. Val Glv Leu Vai 

~ ~ - 2.20 - _ „ 

j^u Glu 3':: - r r Ser ,\sd ~ ~ 



5 110 



i'ro Gin 31:: Pro J'- >- 



^ro Arg Tyr Ser Pr; 
140 



145 ' - -7 " /r '"^ n 3 * r - Gln 21:: Thr 

- 3 0 — ^ 

^ Ka ^ *rp Ar 3 Tyr Pro Pro Ser Pro Pro Pro 

Pro Thr 31:: Ar- - ^ - * 175 

, 1 . . j » 

- -.- - eu ~ ■ ..... . ^ * ^ J * 30 

' * yr3 * Jr '-' Pro :;: v 

n: J "r: 1 " v y ^ z j -/ vai 

A_i Va. Va. Ser Ala 31v I'- V -. r .. - T 7 V ' . . 
:C5 , 3 „, " " - q ^ a ^cr Leu Val 

A - 1 J -' -- r - : Pr= Vai Ala Ala 
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305 3 , 0 

Pro Leu Gly ser ? : 3 Pro Pro Lys , hr val Thr phe ser »0 

GlX Arg Thr Ala Pro Phe Thr Va , ^ "J ^ ^ ^ ^ 335 ^ 

no Val Val ^ Val Gln Qly £ ^ Giy ^ ^ 350 ^ ^ 

360 365 

370 ' "I " y uin Fro Leu Ala He 

J/b 380 



teu Cly Ser Ser Ser As? ^ ^ Val Qly Qla ^ J« ^ ^ 

«y Ser r- ro Leu Gly ,e ; ^ Cly Thr Val Thr t" Gly IIe Val Ser 
-a Leu As, ^ p„ Val ser Thr ^ oiy "J Ma «oo 

Thr " U 5 ^ :le Ma Ue Asn Pro £ ^ 

Ser „ y «, Ala Leu val ^ ^ LI Ala 513 Leu , a , « y val ^ 

Tso " S ^ "' r ^ U ^ ^ 3 *»P Ser Ala Asp £f Gla Ser GIv 

Sfl r ■!„ r „, 460 

=>^- ^_e Kj+y Leu G.v oho a " r, r; a r,».„ ., - , 

465 ' A " e ?r= Asp Glr. Ala Lys Arg lie 

a: -t Asp giu Leu s »- tv,- r! „ , " , "' J 4 80 

" LyS Ser His Ala Ser Leu GIv 

Val 5lr. Val Thr Asn Asu Lvr, Asn ~v>r a-, ~> 495 

50C - " S? = Lys lie Vai Glu 

Val Val Ala GIv Giy Ala Ala Ala Ln Ala GIv Val Pro Lvs GIv Vai 

-a- Val T*- Lv "i" « - - 5 ^ 5 

533 " ' "* "'" ? ?r ° Asfl Ser Ala Asp Ala Leu 

Val Ala Ala Val Ar g Ser Ly^ Ala Pro Gly Ala Sr Val Ala Leu Thr 

?he Asp ?rc Ser Giv Jy 3., 560 

' -■' °" - hr Sin Vai Thr Leu GIv 

- ° ~ ' 7 O _ 

- : -a 31u lln ^5 

:^FCRMATICN FOR 3EQ :j :j 0 :76: 
- 1 ^HQlTENCZ CHARACTER 1 3T I CC ■ 
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70 75 80 

Leu Gly He Thr 5er Ala Asp Val Asp Val Arg Ala Asn Pro Leu Ala 

95 90 95 

Ala Lys Gly Val Cys Thr Tyr Asn Asp Glu Gin Glv Val Pro Phe Arg 

100 105 110 

Val Gin Gly Asp Asn He Ser Val Lys Leu Phe Asp Asp Trp Ser Asn 

lx 5 120 125 

Leu Gly Ser lie Ser Glu Leu Ser Thr Ser Arg Val Leu Asp Pro Ala 

130 135 140 

Ala Gly Val Thr Gin Leu Leu Ser Gly Val Thr Asn Leu Gin Ala Gin 
145 150 155 160 

Gly Thr Glu Val lie Asp Gly lie Ser Thr Thr Lys He Thr Gly Thr 

165 170 175 

He Pro Ala Ser Ser Val Lys Me- Leu Asp Pro Gly Ala Lvs Ser Ala 

180 185 190 

Arg Pro Ala Thr Val Trp He A. a Gin Asp Gly Ser Ells His Leu Val 

195 -00 205 

A " ^ ~ s ? -ly -er Gly Ser lie Gin Leu Thr 3ln Ser 

Lys Trp Asn Glu Pro Val Asn Val Asp 

2-5 230 

INFORMATION FOR SEQ ID NO: _T ^ : 

i. SEQUENCE CHARACTERISTICS : 

(A; LENGTH; 56 amino acids 
13) TYPE: amino acid 
:c STRAND EDNESS : single 
(D) TOPOLOGY . linear 

xi, SEQUENCE DESCRIPTION: SEQ 12 NO: 77: 

,^a Glu Ala Ya. Gin Arq Ala Arg Asp Ser Val Asp Asp He Arg 7a. 

25 30 

nia Arg Va - :le alu G — A ^P Met: Ala Val Asd Ser Ala Gly Lvs '1- 
-*' r '-'' r '^ r ? -V" Leu Slu Val Ser Phe Lvs Me: Arc; Pro Ala SI:: 



:;r -jrmat: j:; -y;~ ses. :e no : ^ . 

S EQUENC E CHARACTER I ST I C S 
A LENGTH: 5 9 amino acids 
. B' TYPE: arr.mo acid 

ST HANDEL NE S3: i r.cj * o 

■ T" r r 7L0GV 1 ; near 
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Cys Ala Ser Pro Pro Ser Pro prQ ^ ?r= _ ^ ^ ^ ^ ^ 
Pro Glv Pro Pro Met Pro Pro Leu L'p Pro Trp Pro Pro aL Pro Pro 
Leu Pro Tyr Ser Thr ?ro ?r o Gly Ala Pro Leu Pro Pro Ser Pro Pro 

Ser Pro Pro Leu Pro 
65 

(2) INFORMATION FO* SEQ ID NO. 79; 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) 5TRANDEDNESS : single 
CD) TOPOLOGY : linear 

SEQUENCE Zi^'SRZP-ZQX: 3 HQ ID :JC : " 0 : 
Mer_ Ser Asn Ser Ara A-a a— ^ ^ ^ 

: ' ^ A - ^ er ^ -r? Ser Trp Leu Leu ser 

1 C 

Val Leu Ala Aia Va 1 P 1 - t 01 , o 1m , , 1 ^ 

2Q ^> ^ la Thr Ma Pro Ala Gin Ala 

^ 1 *~ ^ 3 0 

— Pro Pr3 Ala Leu Ser Gin Asp Arg Phe Aia Asp Phe Pro Ala Leu 

40 45 

Pro Leu Asp Pro Ser Ala vu..- v V 

50 -' a - Ma ?ro G1 - v ^l Val 
Asn He A^r - ^, — ^ 

65 * *>' r Asn Asn Ala Val Gly Ala Glv Thr 

Gly He Val He Aso = >-n , T ' " 80 

^ --o Aj u _ y /dI val Leu Thr Asn Asn His Val 



35 

- ^sp He Asn Ala ?he Ser 7ai Glv Ser Glv 



Aj. a ji V m I a 

: 0 0 

ryr Glv Val 



115 ' : ^ st: "hr Gin Acp Vu^ Ala 

Val Leu Gin Leu Ar 7 Jly Ha ^ r .. ^ ^ . , _ _ 

ijq ,■ .... jer mU. a j^xa G'V 

ji, 7ax ^ia Vai jlv — d_ • - - - 

145 . . 1 " Ja - " et Gly Asn Ser Glv 



wa Va^ 



\r7 Vai Va* 



Pre Val 



1 1 7 a i G I v Me z 



213 - ^ u ^ n Lhr 

Ala Aia Ser Asp As:: y r .e Jin Leu So^ G ~ 3 . . . 

liS w --r Pne A^a 
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Gly Ser Ala Pro Ala Ala Se- ■ 

290 ^ Ue Ser Thr Val I ie 

iS vu 3 - £ ; - - «» «. £ «. «. «, ^ 

Ala Leu Asn Gly His His Pro Cl y Asp val n" Ser val Asn Trp g" 

Thr Lys Ser Gly Gly Thr Arg Thr Gly 2„ Val th. t 

y Asn Val T ^r Leu Ala Glu Glv 

3 4 5 

Pro Pro Ala 350 

355 

(2) INFORMATION FOR SEQ ID NO : 8 0 : 

<if SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 205 ammo acids 

i3) TYPE: amino acid 
* ;C' AJ^ETV^Si; L r - "a 

::i TOPOLOGY: linear 

SECrjEKCS ASCRIPTION: SEQ ID NO: 30: 

T r ^ ^ ^ ^ P G ^ ^ Gly val Pro Val Ser prQ Thr 

Ala Ser Asn ^"-^ ^ - t t„,, * 15 

" CU ^ ^ ^ ^5 Glr. Ser Leu Asp Ala 



" ""'* is'" **" Hld 7d: AU Val AX 3 Thr Thr Glv Ly, 

a^ Ass Ser L^- L^u G 1 - , ^ 

5 0 ^' I!*" AS ? 7al As P Val Arq Ala 

^ b 60 



Asn Pro Leu Ala Ala 

* n 

^5 



-V Val Cys Thr Tyr Asn Asd Glu 



35 ;c ' ~" *^ - -^u Gin 

Val Pro ?he Ar^ v a i 5 30 



j D As 



Mr 7a - -v- -eu ?he As D 



ioc :;: J " - e - " er - hr ^ v a ; 



7i Tj 



rc rt.^.a A^a ~jI ' r Vj ^ - 

us " - eu - eu ^ r v a i -r ^ n 

Ala Gin Glv ^- - . , " 5 



- D 

Ser 



Val A:v; a: i >- ., - .. ■ ... 

, a , * jt -- - j^v Ser 

, 1 Qn 

, "" >:: lr: Va - Aan Val Asp 



WO W4207G 



10 



(X1J SEQUENCE DESCRIPTION: 5EQ ID NO:81: 
Gly Asd Ser Ph* 3 Tm i 3 1 ^ ^ i * , 

x Irp ^ a ^ a ^ a Gin Men Ala Arg Gly Phe Val 

If ^ G1 >' ^ If Tnr Gly Glu sly 11 Gla 

Hi. Ala Asp Gly Hi. Ser Leu Leu Leu Asp Ala Thr Asn Pro Ala Val 

4 0 

val Ala Tyr Asp Pro Ala Phe Ala Tyr Glu lie Gly Z r le Xa , Glu 
Ser Gly Leu Ala ^ „ Bt ^ Gly ^ ^ ^ ^ ^ ^ ^ ^ 



^ Val ^ A - <^ *o ^ Val Gin Pro Pro Glu Pro Glu 

Asn Phe Asp Pro Glu Glv V ri i L eu r'v n., - - 95 

/n. Leu Giy Gly tie Tvr Arg Tvr H 1 s a^ a 



;oo 10B 



~n --^rg Thr As:i lys Xaa Gin 
20 



Leu Ala Ser Glv 

Ala Mer Pro A^ 1 V i t ~, . *\ ^ 1 

- e ~ nia A- a Gin Mec Leu Ala Ala Glu 



* ^ . . , , 140 



:3 : 

■ Val Ala Ala Asp Val Trp Se: Val Thr Ser Tro r-- n„ - 
14 5 1 r ? Glu ^eu Asn 

^55 

Arg A^p Glv Val Val * - ] t* ^h,- n ~ 160 

^ 1 — o.u .hi Glu Lys Leu Arg His Pro Asc Arg 

~ b ~ 170 M 

Pro Ala Glv Val Pro ^1 ~w * C,' 175 

" — Arg Ala Leu Glu Asn Ala Arg Gly 



180 
Ala Val 



135 



190 



195 



er Ago Tr ? Arg Ala Val Pro Glu 



Arg Pro Trp Val Pro 



20 



205 



i^eu Thr Leu Gly Thr Asp Glv Phe 

::5 ^ a -^ rCT -"-g -*/r Phe Asn Thr Asp 

Ma Glu Jer Glu Va ' * " • - • - - . . ^ w ^> 
14 ^ 

My Ala Glv Ar; 



250 ^ 

Asu Pr" Ph^ 

Ala 



270 



-eu Pro G^v Phe Asu Gl- -., * 

'■ - -eu ,vrg Pro Xaa v V s 



GBP 



1 S ^ 
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Asn His Ala Met Met Leu V^ ^^ 45 

5 0 ^ Gln ^ ^ Asp Arg Asp Leu Arg 

Val Glu Il« d-o — v v., . &0 

■ " * ' ASP Thr Val Asn Gin P he ^ ?ro 

Arg Glu Ala Leu \la Lpu ai r 75 30 

A- Leu Ala Leu Asp Gin Glu Arg Thr Val Thr ^ 

Gin Val Gly Arg Leu Thr Ala Va^ Ala Ira * ^ 95 

100 a ^ ^ P Glu G1 y Asp Phe Leu 

Gly Glu Gin Phe Met G'n r>u 1°* 110 

115 ^ ^ ^ LeU Gi * ^ Gin lie Glu Glu Val 

Ala Leu Met Ala Thr Leu Val Ar~ Va i m * 125 

130 ^ ^ Val Ala Arg Ala Gly Ala Asn 

Leu Phe Glu Leu Glu Asn i m 14 ° 

14S ^ * he Val ^ Glu Val Asp Val Ala Pro 

Ala Ala Ser Gly Ala Pro Hl3 Ala M- - ^ 160 

° ^ a ^ a Gly Arg Leu 

~* " 170 

INFORMATION ECE 3EQ 10 \ T 0 : 8 J . 
(i: 1 SEQUENCE CHARACTERISTICS- 

(3) TYPE: ammo acid 
(C) STRA17DEDNESS : single 
TOPOLOGY : linear 

X:J SEQUENCE INSCRIPTION s'v~ — . 



Arg Ala Asp Glu Ara Lv; 



^ Thr Thr Mec Lys Met Val Lys Ser U e 



Ala Ala GIv „eu "^>- — ^ ~ C 15 

. — - ,,_a rt^a A^a ^^, r . ^ 

-0 ^ A^a Ala Jly 

Thr .Ser 1 1 ^ v e - ■ ^ • , , ^ ^ ^ 

1~ • ^" " Jrz: 7a - Tyr .-'In Met. Jin ?ro 

"■'al Val ?he Glv Ala ** 5 

5C „^ - ~ eu Xaa Ser Ala Pro ;.; aa 

Val ?ro Thr Ala a' a ^* - -n 60 

" r - Caa *- c - ^eu Asn Xaa Leu Xaa Asd 

Pro Asn Val Ser Phe Xaa As- ' — ^ . , 3C 

- v "j- '■' Xaa " I 1 • r ■ ■ ■■■ i ■ ■ ' ' 

.•.I •" •'• Ja Xaa Zlr. 



TYPE : annc aoid 
-STRANEEOriESS : ji-qI^ 
--rC~C~Y- linear 
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20 25 
Arg Arg A^a Leu Glu Leu Gin ,v a P-o 

wal Lys Glu Pro Leu Xaa Thr Glv 

50 55 ' — 

Pro He Gly Arg Gly Gin Arg Gin Leu 
6 * 70 
Gly Lys Asu Arg Arg Leu Cys Arg TU- 
BS 

Glu Leu Gly Val Arg Trp IIe prQ 

100 105 
Val Gly His Arg Ala Arg Arg Gly Thr 

115 120 
:) INFORMATION FOR SHQ ID NO : 3 5 : 

' ~ ) sequence characteristics • 

X-L . amino acids 
>3) TYPE: ammo acid 
:C! STRAND EDNESS : single 
<S1 TGPOLOGY: linear 



Ser 

Lys 

He 

Pro 
90 
Ser 

Tyr 



Val 

Ala 

lie 
75 
Ser 

Arg 

His 



30 

Val Xaa Arg Gin 
45 

He Asp Ala Mec 

SO 

Gly Asp Arg Lys 

Ser Asn Gin Arg 
95 

Cys Ala cys Val 
110 

Arg Arg 

125 



Thr 

Thr 

80 

Glu 

Tyr 



(xi) SEQUENCE DESCRIPTION: £EQ ; D NQ 

-vs Asp Ala Val Men Gly Phc Leu G^v 
5 

Val Asp Gin Gin Leu Val Thr Arcr Val 

^r. Ala Ala Ala Val Pro Val Va^ 

^ ^ ? Leu Glu :i, Lys ,ua Gly 

^ V 31 V 7al ^ V Me ~ -la 

31/ Vrl1 Va - ?he Val Thr Ala 3 — 

95 

Arg Al * Xaa Xaa ?he Asd Ast: Xaa 

ioo * , 

.-^g oer ^er Xaa Glv 

115 



35 : 

Gly 

10 

Pro 



Ala 

Gin 



Leu Th 



j^u Ser 



90 



Gly Pro Leu Ala 
15 

olv Trp Ser Phe 
30 

ALa Trp Tyr Gly 
-4 5 

"al Leu i i a His 
") G 

l^u A. a Arg Gin 

: "rp Asp Thr 
95 

X a 4 ?h e Pro Hi g 
110 



Val 
Ala 
Leu 
Al a 



3C 
Leu 



Xaa 




HO 94/42076 
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115 

Thr Ala Pro Vai Pro Ass Tyr Tv~ ^ ~_ ^ 

35 ^ ^ *™ oly Gin Pro Phe Asd 

Pro Ala Tn> Glv p- D A — t>_ , 45 

n Tr P Asp Pro Tv- ^h- r> 

55 ~° A/r ^ °y s His Asp Acd Phe 

*» - s " »> «r p,o „, „ ls s „ ^ l° p ^ t „ «... p . o 
115 Leu Mu ,T " ,; Le " •* - ^ «• - «. ? „ £, 

Pro Ala Ala Gly Gly Gly Ala ^ 25 

100 

(2) INFORMATION FOR S E C ID NO : 8 7 .- 

fi) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 38 ammo Icids 
^ rrPE: am ino acid 
C) STRANDEDNESS : smcrU. 
•UJ TOPOLOGY; linear 

^ } 3HCUENCE ^CRIPTTON: SEQ ID NO : 3 - 

Val Gin cys Arg Val Trp r^u Glu G , „ _ , 

1 5 Ar S Gly Met: Leu Glv 

Ala Asp Gin Ala Arc; Ala G^ <- v ^ ^° 15 

20 - ° X/ f ° A ^ He Trp Arg Glu His 

Ser Mec Ala Ala Met ' v 5 ?~ ^ 30 

35 '° ^ Thr G1 - / As P Gly Pro Leu Glu Ala 



'/s Glu Gly Arg Glv - - 45 



Val Met Ar q Val Pro Leu Glu Gly Gly 



-Arg Leu Val Va ' S' -I., -v, - 60 
° 5 "n" AGp Glu Ala A1 a Ala Leu Gly 

AS? ^ ^ - ' ^ Thr 

:^-hry:;= : ; ?GH sk; :;C:33 

1 SEQUENCE CHARACTERISTICS 

A) LENGTH: * 5 aminc'Icid^ 
3) TYPE, ammo acid 
C TRANDEDNE 3 S ■ 




WO 99/420*^ 
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I 14 



90 95 



:) INFORMATION FOR SEC ID NO:89 ; 

(ii SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 166 anuno acids 
(3) TYPE: ammo acad 
CC) STRAND EDNESS : singj e 
(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Thr Gin Ser Gin Thr Val rh- Va ' Ac-n ~ ^ 

1 5 Va ^ ^ '^ In Glu lie Leu Asn 

Aurg Ala Asn Glu Val r 1 !! au n „ 10 15 

2Q G.u Ala Pro Met A! a ^ p Pro PrQ ^ ^ ^ 

?ro :: e Thr ^ - 25 30 

3 5 -" r :<aa Xa * -7S 'Asa Ala Ala Gin Glr 

Xaa Va' -- . 40 -45 

- .■>__ rtj. a ,isp Acn Me' Ar~ "!■■ - 

50 55 -' r -™ A^a Ala 31 v Ala 

Lys Glu Ar 3 Glr. Arg Leu Ala — ,„ f° 

^ 01, Cl-a Val Asp 31a slu ^ ^ ^ £ ^ ^ ^ ^ 80 

J Q A ^ 



^ Glr: Ala Glu Ser Ala G 



90 95 



LOO 



105 



ly Al,i Val Gly Gly Asp Set 



110 



Ser Ala Glu Leu Th- ^ n D 

115 5 Aaa Thr Ala G ly Glu Pro 

As n ?he Me* - A*~^ ~ p> • .,, " ^25 

130 " A * a A ~ Glu Thr Gly Asp 

14S "" J — Asp Trp As, Thr Xaa Thr 

Thr „eu 31n .Jlv ,\sc 16c 
lc 5 



NFORMAT I FOR 



SEQUENCE L*K7vRACTERI3Tlc: 
A ^NGTH: 5 ammo Ic: 

TYPE : im:r.: *c : : j 
^ ~T?ANDEi:NE;;s . 




WO 99/420"6 



II: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Val Ala xrp Met Ser Val Thr Ala Gly Gin Ala Glu Leu Thx Ala Ala 
Gin val Ax 3 Val Ala Ala Ala Ala Tyr G "u Thr Ma Tyx Gly Leu Thr 



25 



Val Pro Pro Pro Val lie Ala Glu Asn A,g Ala Clu Leu Z Ile teu 
lie Ala Thr Asr. Leu Leu " n ^ Thr prQ ^ a ^ ^ ^ ^ 

Glu Ala Glu ryr Gly Glu Me C Trp Ala Gin Asp "a Ala Ala Met 

5 

la Thr Leu Leu Pro Phe 



55 7Q ^ ' A£ »P ^ a Ma Met Phe 

Gly Tyr Ala Ala Ala Thr Ala Thr Ala Thr lu ^ 



85 



Glu oi_u Axa Pro Glu Met Thr Se- -i^ riv 

A * u ^-Ld Gly Gly Leu Leu Glu a 

^ u ij 

Ala Ala Val Glu Glu A. a Ser As 



105 



_ 5 lr AZa .Ala Asn Gin Leu Met 



ASR £j Pr ° Q1 * ^ ^ is Gin Leu Ala Gin Pro Thr Gi., Glv 

Thr Thr Pro Ser Ser Ly S Leu Glv Glv Leu Trp Lvs Thr Val Ser Pro 

I'D U "ICC 

His Arq Ser Pro 3 p^ ^ ^ , n ^ 160 

Asn 7a - Met Ala Asn Asn His Met 

Ser Met a^p , 175 

lie '"" " r ^5 Thr ASn — Ser Ser „ ec 

Leu Ly S Glv Phe Ala Pro Ala Ala £l Ala Gin Ala Val Gin Thr Ala 

/a - Ar^ A". .i Me: Ser Ser 



^ - a Gin As n 

Ser^e 

ie2 " " V ^ " ^ - 31v Ar 3 Al, 



^ : ' " "" lp J - v -ys Tvr Ala Xaa S**- 

5 J 5 U 

Arg Arg Asn oly Glv Pro .Ala 

.... . r . jr; ^3 :ic.3: 

- TYPE arr.ir.o icia 



- 4 ^ 



WO 99/42076 

116 



P( 1 I VJ9 OMUH 



50 q _ 

60 



l« «p „, u . v . : s „ My £ Th _„ 3iy Lm 

™ «y .1 Al. u, u. ». „, u, l. u IL „ y val w „ - 

90 

p» ^ 1. J.J. te B . Ihr v>1 „. vaI Leu Gly » ^ 

«, ». P.. L « u M « e wl ,„ ^ ^ U. ^ ^ 

- £ «, ^ ^ ^ Vli Ujl Leu „ a ^ ». ^ 

Ala VaI Ala Ala Val Leu ai, , r 140 

14 5 ^ Wa LCU L « u Va l G!u Thr Gly Ala lie Thr 

Ala Pro Ala Pro Arc: a-o w„ D u„ . „ 15S 160 
~o _,s Phe Asp Pro ^ Gly Gin ^ ^ ^ 

Tyr Gly Gl:i Tyr Glv Gin ^ v v „ , "° n "5 

180 ' ' ^ j1r Pro G1 '/ Gly Tyr Tyr Gly 

~~' J ~ a Aia A1 * I-eu Gin Ser Pro Glv ^ 0 



*~n Ser Pro sin o — p~ ^ , 

210 " " *~ ^ 3 ^ Si" Gly Gly Tyr Ser 

" n 220 



Ser Scr Pro Ser g 1 - r^r c ^ 220 

225 23Q " G ^ T>- Th: Mi3 g1r Pro Pro ^ 

31r. Pro Pro Ala Glr. 5»- Glv c; 0 >- r^^ ^ 235 240 
, 4 „ ' ^ Gla Gln Ser His Gin Gly Pr0 Ser 

Pro Pro Thr Gly ?he Pro Ser Dh e Se . o« p B 2S = 

260 Pr ° ?ro ?r = Val Ser 

Ala Gly Thr Glv So- 3^ - -, ~, . 270 

275 " ~ ^ ;t> bSr ^ Vai As:: Tyr Ser Asn 

' ° ber -r. Ser Ser Se>- P>- 3 — ^ T 

290 , ^° — / ^7 Ala Pro Val 

300 

; A} 1H - ;GTH; :s acids 
(R' TYPE : amino acid 

.1T2AKDEDNESS : single 



WO 99/42076 



rCT/rS9 l >/03268 



in 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 
Asp Gin Val Glu Se- T*r n ^ 

1 ' ^ ^ a Gly Ser teu G ^ Gly Gin Trp Arg Gly 



io 15 



(2) INFORMATION FOR SEQ ID NO: 95; 

(i) SEQUENCE CHARACTERISTICS- 

fA> LENGTH ; 2 7 amino acids 
(B: TVPE; amino acid 
(C) STRANDEDNESS : single 
CD) TOPOLOGY: Linear 

£xi) SEQU^CZ DESCRIPTION: SEQ I D NO : 95 ; 

3ly Cys Z1 V S ^ Thr Ala ~Iv ^ ^ _ _ 

1 r — — ; j-l;. *rp Arg ^lv Ala 

Ala - . r, ^ 10 is' 

nQ A^a ,ai /a ^ Arg 
J 5 

2: INFORMATION FOR SEQ ID NO:96: 

fi) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 27 amino ^acids 
<3) TYPE: ammo acid 
(C) STRANDEDNESS: single 
f") TOPOLOGY: linear 

SEQUENCE OESC^^~-^t ~ 

j*y 3^y Thr .Ala Al. - ■ , . , ^ . T . 

1 " ^ Ja - /a - <^S ?he sin Slu 

* *■ ^ 

A* a Ala Asn L / s "i* - ■ • — - • „ ■, , : ~ 



INFORMATION ^ 3E~ — vn — 

^ SEQUENCE CHARACTERISE ~ 
A.- LENGTH: E im^u 

- S TRAI ,t> ED NE S " 



-a asp. Lvs - - . , ( - - ■ „ , - 

- " ' J - u s:-a lie ser 



WO 99'420"6 
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(B) TYPE: ammo acid 

(O STRANDEONESS . single 

CD) TOPOLOGY, linear 

(Xi; SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

«y Cys Gly x le Ar g Gln Ala Gly Val ^ ^ ^ ^ ^ ^ ^ 

Glu Gin Gin Gin Ala Leu Ser Ser Gin Mez Gly Phe " 

20 ?5 

(2; INFORMATION FOR SEQ ID WO: 99: 

;'l) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 507 oa se oairs 
■3) TYPE: nucleic acid 
(CI STRANDEDNESS : single 
O^ TOPOLOGY; linear 

SEQUENCE DESCRIPTION: SEQ IDN0:99- 

ATGAAGATGG TGAAATCGAT CGCrGCAGG- — 

^ - jU{blJ * ^*unC^^,CCG CGGCTGCAAT CGGCGC^rT 

~£^5 "™ ^f= K TATACCAGAT 

cact^S dS:£l :rf:f ccc CTO ^^ ^,0^= 

= J™ ~ i™ — ~ 

CGGCCACCGC CGAC3— a—™- ^f^ T C=AGCCGGCG 
ACGCAGAACS TCACGTTCG*" GAA m CAA~ ~~™r ^ GrtAGCTCTC GTCGCCGKSTC 
3AGT~GCTGC AGGCCGCAGG S^T*" ~™ ^ ^CACCCGC ATCSGCGATG 



:nfckmaticn for sec :l nc i ■ 

* SEQUENCE OHARACTERISTICS .- 

A) LENGTH : 153 ami- ici-:> 

B) TYPE; ammo acia 

■ c; STRANDEDNESS . single 
.0) TOPOLOGY, -in ear 



60 
120 
180 
240 
J00 
J60 
420 
480 
S0^ 



:Eo:—nc: 



IE, 



:'::r Am 



-J A:d Ala 



Am 



* . * A ^ a 



- : 1~ Me: 



-^r a . .1 si- 

30 



31 v Pre 



* 5 

V. -i 



-a^ 7a. 



i Pr : 



m 7a; 



;a Pre Leu Pre Leu 
45 

■ * ^" ~-?u Thr J«r 
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i 19 



:o5 



V ^ ^ ^< ^ P - ^ Ma Ala Gly Ser Ala £ Md ^ p 

Val Ser Vai Ser oly Pro Lys Leu Ser Ser Pro Val £ cla ^ Val 

I« V ^ ^ ^ ^ ^ ^ t Leu Ser ^ Ala S er Ma Mec 

Glu Leu Leu Gin Ala Ala Gly Asn ^ 160 

1S5 

(2) INFORMATION FOR SEQ ID NO: 101: 

;i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 
(cj stra::dedness : jingle 

- .. . ; .mear 

x,, 3ECUE.VC" -SC?.:pr:OK: 3S C ID NO: 101: 

StSS SSaA IS~: g c :: S;: gcctcc — wc**** 

GGCTGGCGCA CAGTTC^CS CC^CC^ C ^CGGG 120 



5 0 



6C 



CGCACCGCCA CGTCAGCGCG -p-I--- IATTTGCGCA ATTTCCTCGC 

ACACTACA7C GGC" - G At™- - ZCZC!A ™ CAAGCTGTGC CGGGGGCGGC 

GGGC~=A~ — C7CCTGC AACAACTATT AAGCCCATGC 

ACGGGCCGCA : -™AC~ - 3GGGC ~ A ^ CCAGATTGCC CCGCTCCTCA 3S0 



b 00 



3C CGGGGCTAGG CCAGATTGCC CCGCTCCTCA 
GGGGCTA GGGCAGATTG 
CCACCG CGCTGGAGCT ~~ — ~™GG GATCCACTAG TTCTAGAGCG 

nformatic;; ?cp. sec is :;o : : r. ■ 

^EC r JE:;CE --lAi^ACTERISTIC" 
rt, — =.:;GT:i : ?6 arr.:::j ic-.ds 
.3; TYPE, amino acid 
C, oTR.V-ECIJESS . 3 :.ig:- 
iE; TOPOLOGY: linear 

-■•■■'< 5ecue:;c»- cesc^:tti2\*. 



a: 1 ... _ , ... . 

" • ; ^ 'V- At 1 Val ,M > 

A - A r, r. .1. 7;:r ,, r ..... . . . ^ 

' ' ■ " • ^ ^ A^r. Ala Ser 

- phM ~-a A- a Pro Pro Pro 



WO 49 42076 



PC T I S*;9 03268 



120 



fi! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 154 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103 : 



^JJfSS* mCGC?3GT "C^GCCCc; CGGCMCCOC AATCCAGGGA 

AAICjx CACGT CATTCAT^C fr^ r ^-Tv rr ,, r , 

n * L — c — .GAC omGvajGAAGC AGTCCCTGAC CAAGCTCGCA i-n 

GCGGCCTGGG GCGGTAGCGG TTCGGAAG CG TACC ^° 

154 



60 



11 INFORMATION FOR SEQ ID NO: 104: 

'.'i) SEQUENCE CHARACTERISTICS: 

■'A) LENGTH: 51 amino acids 

TYPE: amine acid 
c: S TRAND EDNE S Z . single 
'D; TOPOLOGY: linear 



"TENCH DESC? 



jEQ ID NO : 104 



MeL Thr Glu Gin Gin Trp Asa Phe Ala Gly He G lu Ala Ala Ala Ser 
5 

Ala He Gin Glv Asr ,; a ' ^'-v- „ ^ 

. .er _e M iS Ser Leu Leu AsD 31u 

i-/s j^n Ser Leu Thr L*'3 ' — - ^ - - . - - ^ 

- rt ~ a -rp -oiy Gu.y Ser Glv Ser 

3 5 * n 



\~a 

:0 



INFORMATION FOR SEC IT \\~ - 'iu . 

i SEQ UENCE CHARACTE P. I S T I C S : 

A; LENGTH. C32 bas^ -aire 

:B) TYPE, nucle- . 1c1 h 
I) STRANDEDNESS : s;nqle 

■D! TOPOLOGY: lined: 



45 



— jdTAACTI AG^ 

- - jN JGGGGGCGCC tac 
"GTGTGG TCTGNAGCCG GA.: 

TNE~ PMAT 1 'JN FOP - 

-E;r:ENCE C-IAEACT 



- - . i/\\j-o/Y. ^ ^ ^ .1/1/1 _ . 

- C _A ( jAAC atcctgacc 



" ■ .-utrt* — .ToA ig:; 

GAAGCGG TGCTCGACG~ "G 



121 

ixi) SEQUENCE DESCRIPTION" - SEQ I- NO: 106: 
3A7C3TACCC GTGC3AGTGC TCGGGCCGTT TGAG-jATGGA 3TGCACGTGT — CGTGAT 
3GCATACCCA GAGATGTTGG CGGCGGCGGC 7GACACCCTG CACAGCATCG GTGCTACCAC 
TGTGGCTAGC AATGCCGCTG CGGCGGCCCC GACGACTG3G GTGGTGCCCC CCGCTGCCGA 
TGAGGTGTCG GCGCTGACTG CGGCGCACTT CGCCGCACAT GCGGCGATGT ATCAG" CCGT 
GAGCGCTCGG GCTGCTGCCA TTCATGACCA GTTCGTGGCC ACCCTTCCCA GCAGCGCCAG 
CTCGTATGC3 3CCACT3AA3 TCGCCAATGC GGCGGCGGCC AGCTAAGCCA GGAACAGTC3 
GCACGAGAAA CCACGAGAAA TAGGGACAC3 TAATGGTGGA TTTC3GGGC3 TTACCACCGG 

ag^gaac— -gcgagga.^ :-acgggggg: 

AGA7G7GGGA GAG: 




oO 
120 
ISO 
240 
3 00 
360 
420 
4 8*C 



pcggccgc GTc-3Gcsrrr cagtcgctgg 540 

7G7G3GG7G7 GACGGTGGGG TCGTGGATAG G— ~ G ~— r -™~r— ^ 

Gv^^T.AT, G7GGGGGGGG 600 

-TGGGCGTA 7GTGGGG7GG ATGAGCGTGA CCGCGGGGCA GGCGGAGCTG ACCGCCGCCC " 660 

AGG7CGGGG7 7GG7GGGGGG G C GTAC GAGA GGGGGTATGG GG7GAGGG7G — — c ^ _ Q 
7GA7GGGG2A 3AACCG73C7 GAAG7GA7GA 7TG73A7AG G GAGGAAGC7G TTGGGGCAAA 

94C 



_ J 0 C 



1 C 2 0 

: o 3 o 



W O 99/42076 
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* * CCGC3GC - GGCTATGACA ACAG7GAA^ — 

^AG.^_ .^TGACAA GTTACAGGTA TTAGGTCCAG 

G7TCAACAAG GAGACAGGCA ACATGGGC7C ACGTT-ATr 

^L^TTTTATG ^G^ATCCGC ACGGGA7G 

GGACATGGCG GGCCGTlT^a, ^r^Trr-n^^ - 

TTTTG AGGTGCACGC CCAGACGGTG GAGGACGAGG CTCGCCGGAT 

OTGGGCGTCC GCGCAAAACA ^CGGTGO GGGCTGGAGT GGCATGGCCG AGGCGACCXC 

CCTAGACACC ATGGCCCAGA XC^TCACCC G^CGCAAC ATCGTGAACA TGCTGCACGG 

GGTGCGTGAC GGGCTGGTTC GCGACGCCAA CAACTACGAG CAGCAAGAGC AGGC^TCCCA 

.CAGAXCCC AGCAGCTAAC G.CACCCCC, 3CAGCACAAT ACTT7TACAA GCGAAGGAGA 

ACAGG--CGA TGACCATCAA rTATCAA"—- ~r-~ - t—„., 

--a.Uww._ -Xk^TGTCG ACGCTCACGG CGCCATGA7C 

■ ~- ~ ° — j GGGGG7TGC7 GGAG' i'"*^ ~ 
3GGAGTGAC7 TTTGGGGCGG C3CGG' — 



1680 

■ GCGATGCG 174 0 

1800 
1860 
1920 
1980 
2040 



^G — GACC 216 0 



--v^.^^ 0 ^-GGGTTCTT CGG 

2AGTGG7GG7 GGG 



CGGG 7GAGGACCTG 3AGGCCCAGG TAGCGGGGTG CT7CGA7 



^w„ Jv : AGCGAGCGGG ' ' ■ ■ 

ju^umGgt:; A"~~tgga~g~ — - 



2220 

2280 



ggggctgcc aggggttcat tacccagtt 
ggccgtaact tcgaggtga, ctacgagcag gggaagggcc acggggagaa ggtgcaggc 

GCGGGCAACA AGATGGCGGA AAGGGACAGG GCCG— - 

^G.TGGGC CTGACACCAG 2340 

GCCAAGGCGA GGGAGGTCG7 G TAG G AG 7 G A AG——- 

--^ vj i an.CrTTC GGG7GGCA 

G7AAGTGG7C AG7G 2TGGGG 7G77GG7GG7 — - 



2400 
246C 
2520 

7GA7GGAGG 2GCGGTCGGG : 5S0 



2-700 
2^6 0 



J77GGACGAG A777GGCGCC AGA7~~ G— 

- -^v^ JO _ G7GAACGCCA SCAGGTC3GT 

>-^GGGGG7G 7C3AGGTGC7 GGGC"* AC~~" — . — ™ , 

. _AGmG -GTCGAG7AC 
- - - w . a . . GGG CAACAA 2T'JA7*"C 

. ^ . . , AG7G . ^ ■ 

" * — 2GAGGAGGCC ^ - 

^ ° — ' --o^^ t^G7GA .7 GAG 'j'j — -• ^ ^ ^ ~ » „ , , ^ 



c s ^ 



WO W42(>~6 



pe r rs>><» oj2(iS 



122 



!3! TOPOLOGY: linear 
(X.) SEQUENCE DESCRIPTION: SEQ ZD NO: 107: 
Met val Asp ?he ^ Leu p „ prQ ^ ^ 



Asn Ser Ala Arg Met 



10 1S 



Tyr Ala Glv rl __ - ^ R - 

v*-y . Gly cer Ala Ser Leu Val AJi Al^ 5 i ~, 

20 1 ^ a Ala Ala Gin Met Trp 

2 s 0 



A^P Ser v.! Ma Ser Asp Leu Phe s „ ^ ^ ^ 

Val Val Trp Glv Leu -s- t 



35 ' ^ ^ a ^ a Ser Ala Phe Gin Se- 

4 5 



50 _ > er Tr ? Gly ser Ser Ala Gly 

55 bC 



Leu M* 3 '* Vt 1 - - i 

, Wta ^j_ d p - , r .. , , ^ 

^5 ^ - a - ~rr> Met Ser Va I Thr 

? - 30 



^ a G1 - Leu Thr Ala 

9 0 



a Gin Val Arg Val 



65 _ A - a Aid Ala 

95 

Ala Tyr Glu Thr A ■ i T 

100 - A L ^ T ;;; V ^ Pro Val Ile Ald 

110 

3lU Aiin ^ ^ a ^ ^u Me- He Le- - e Al. ^ * - 

1„5 u ~ " e -^r hsu Leu Leu Glv 

125 

3ln As;: Thr — - • - — 

^3C rt ^ a -vr G-lV Glu Met 

* J " 14 0 

Ala 1 1 - - ■ _ . - 

:50 ~ J '-* Aia Ala Thr Ala 

- - " ' ' Ala ?r- Glu m^- -t^- 

Jer Ala Gly v v Leu ^ 

130 ^ ^ a Alj Glu Ala Ser 



Ser Asp. 



WO 99'42(P6 



I'( T I '032()S 



J 24 



Ala Ala Ala Gin Ala Va n -v, 

d - Gi - 1 ^ °m ci y Va i ^ ai 3 

" 80 285 
Me, Ser S« Leu Gly Ser s , eu Qly ^ ^ ^ ^ ^ ^ ^ 

^ 300 
Vai M a Ma Asn Leu G} ^ Wa AU ^ ^ ^ ^ ^ ^ 

315 

Jlb 320 



Pro Gin ^ a Trp ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 

J " :j 330 S 



335 



Ala Leu Pro Leu Thr Ser Leu Thr 



340 ^ Ala Ma G1 - ^5 Gly Pro Gly 

345 350 



~r. Met Leu 31;, aiy Leu Pro Vai 31y run Met 3 ]v »1, . ... 



3GC 363 

Til ^ U G1> ' ^ Ara 7a: p ~ Arg Pro Tyr Vol Mec 

' ^ 380 

Pro Has Ser Pro Ala Ala Giv 
385 390 ' 

INFORMATION FOR SEQ ID NO- 108 : 

: SEQUENCE CHARACTER: 3 — s ■ 

(A) LENGTH: 1725 blsp'pairs 
'3i TYPE; nucleic ac::i' 
- - 5 TRAND EDNE 33; S1:1 ^;„ 

,J *" J " : "^TTGATCTGC 3G7CAAGG7G c . 

GCGTGTC3CZ GGC3~GGA™~ - - - ~ 

CGAT 3CCGCTCTTT AG TG C AACT A ::c 
- '-'TTAATGTA GCGTTCACCC : : K 




WO 99/42076 
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C " KC3 - A CCACC ^ tac accscccwG 3 ~cc~~ 

^GGrCGCC GCC3CGAAGA TGTGGGACAG ^ 

GGCGTTTCAG TCGGTGGTCT GGGGTCTGAC GCTr^^ 

TCTGAC GCTGv^GTCG TGGATAGGTT CGTCGGCGGG 

TCTGATGGCG GCGGCGGCC7 Cr.rcGT^rT ar~r-^ 

TA.GT GG.G.GGATG AGCGTCACCG CGGGGCAGGC 

««™*c =ccccc,-,oo ™ GC „_ raMcAc ccrate 

rz™ Tc3cc=iMA cc=iac; °" ™ B ™ D!c TGAracMc 

^a^jjjicj C3CC=CC=A7 "jAGGCCAAT CAGGCCSCAT ACMCCAGAT 



AA 3ACGC jG A C n ^•^••^ m,-,^,- 

" — — A-GG CTAC3CCGCC ACGCCGGCGA r^CGACC; 

ort^...,.,^ GCCGGCGGGC — 



GGCC3 



CGCG GTCGAGGAGG GCATGGACAC GGC— ~ , w 

— ^w u ^ AACCAGTTGA T G AA C AATG Z 

TGCAACAGG ~GG~"~:ir~~ * 

-^AG_ .G.GCAGGGC GTCGTACCTT CTTCCAAGCT 

'^ CA TC7GTC ---- CTCAGCAACG TCAGTTCGAT 



CGGTGGGCTG 7GGACGGCGG TC 



AC^.CSA TGATGGGCAC GGG737G7C3 A7GACCAACA CC77GC^ 

GATGTTGAAG GGG™ AG^' 1 

j-'-j'-ou^^C TCAGGGGC 



ATGAG^TGGC TGGGCAGC 



GAAACCGCGG CGGAAAACGG 



- TGGG 



-^CGGGGGCGG T3C 

-GGCT^ACC AGCGCGGGGr* AAACGGCCCC GGGAGAG. 
--'rCRMATlGN ?0R SES 1U "0:139 



A — ^ a: i ^ u Pr „ 



720 
780 
340 

900 
960 
102C 
1080 
1x4 0 

L320 
138 0 

1500 



rCGCTGGGTT GTTCGGGTCT 15 6 0 



WO 99/42076 



K I1S99 .03268 



126 



Val VaI 
50 



r-T Gly Leu Thr v. I Glv Ser Trp Ile Gly Ser Ser ^ ^ 



55 60 



£u „ec Ma Ma Ma ,a Ser Pro Tyr Val U , Trp Met Ser VaI Thr 

° 80 

Ala Gly Gin nip T _ 

y i- iua .i. Leu ltu Aia Aia Gln val teg Ma ^ 

90 95 
Al- ^ G lu x*r Ma Tyr M- g Leu Thr Val Pro ?ro prQ ^ ^ ^ 

105 110 

Glu Asn Arg Thr Glu Leu Met Thr Leu T *r VLa -h>- a t 

115 ^ iitr Asn Leu Leu Glv 

120 l2S 

:3o rhr ?1U ^ a ^ e r:: A1 * - a ^ ^ ser ^ Me= 

: o 

■I'rp Gly Gin Asp Ala Glu Ala Met: - ^ 

^ 150 Thr Ala Ala 

160 

*hr A^a Thr Glu Ala l»u ^ p v n*-^ r>w «■> 

^ ^ U P ~° Phe Glu Asp Ala Pro Leu II, Thr 

170 175 " 

As" Pro Gly Glv Leu — - - - . 

180 ^ A ^ Va: Glu Glu Ala :i e 

135 lgc 



'.r A^a Ala Asn Gin ' m^»- ■ . - 

:35 pr - Gin ; u a Lev 



: ° 205 



^u Ala Gin 



^ Glv Val Val 



-o Ser Ser 1,-/3 Let 



- " ""■ ^-u Va^ - - ., 

" 3 233 :35 

■'al Ser Ser He A^ a -\cr " . v 



W() 99 42(T6 PC I I 



Ala Ala Arg Ala leu ?r:; leu Tar Ser Le 
Ala Pro Gly His Met: Leu Gly 



340 " ~- Thr Ser Ala Ala Gin Th: 

345 350 



355 



(2) INFORMATION FOR SEC; ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH; 3027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : single 
ID) TOPOLOGY; linear 

XI ! 3ECUENCE DESCRIPTION: 3EQ ID NO;.-: 
AGTTCAGTCO AGAATGATAC TGACOCG"'"" „ _ 

- vjAgaca accgaa cgv - 

-GTCSGACGG GGGGACAT^G CAAG""CA~" — ~ 

o^CCGCCGAA GCCGAAGCGG 

- C G AA G 0 2 G A AGCGCTGGCC - — — - r ~^^..„._ 

CCGTGCCGCC CGGTTGAAGC 

jrtyuC3CT GGC3ATGGCC GGAGCGGAGG ACGAGAACGT GCGGG AGGAT ATCCAGACTG 

KirtnuACGC? GAAGACTATG A C G A C *" A ^ G A ----- 

-.--v ^u^.^ oCGGCAGACC AGGAGGCCG^ 



OA T. j 3CG G 01 G CAGTC G T C A T GAT T 
j^-^l,^. 1AGGCCAC0G AACG7C 

:accaaggtt gtcgaacagt gcaaag: 



-j ^ w . .. _visjv_.-aGCGGG GAGGGGA"] 
1ACCGAAGG0 AC3GTGAACG "GACAGCC 



24C 

IOC 
3 6D 



IGCGGTTA GOAAGACTGT GOACGATTGC 

" v --^~~ -TCAGCGGAT ACATTGTGTG 

j -j^-j - * ww^.. ^ >-^>GAGCCAA -\8C, 

-•T-fv^-fw-. i j _ ^-rtAAGAAG AC GTGG -"G - • 



'Wi^GGAA*G lACGG^GCA' 



96 



WO 99/420 7 (» 



I*C T I 03268 



128 

AGGTCGCACC TCGCCGGCnA — ^^^^ _ 

_C_TC 7A7ACGACCA G7TCACGCAG CAGATCGTGG 

C-CCGGCGGC CAAACAGAAG TCAC TGAAAA CCACCCCCAA GG7GG7GCGC GCGGCCGTGT 
CGCAGCTACA TCCGGATTCG GCCGTCGT.C TGGTTTTTGT CGACCAGAGC ACTACCAGTA 
AOGACAGCCC CAA7CCG7CG ATGGCGGCCA GCAGCGTGAT GGTGACCCTA OCCAAGG7CG 
ACGGCAA7TG OCGAVCACC AAGTTCACCC CGGTTTAGGT TGCCGTAGGC CCTCGCCAAC 
TCTGACGGGG GCGCGGGTGG CTGC7CG7GC GAGA7ACGGG CCG7-7C7CCG GACAA7CACG 

GCGGGACCTC AAACAGA rT, ~~ p^r- — r^-^ 

— CW_c_T CTAA7CGGCC GGGTTATTTA AGATTAGTTG 

CCACTGTATT TACCTGA7G7 7CAGA77377 CAGC7GGA- "AGCTTCGGG GCAGGGC3G.. 

.^TCTGGG G7TG7~ACTA GT73AGAGAA 7T73AGGTGT 7GCCGACG77 

G777GC737C CATCA77GG7 GCTAG77ATG GCGAGCG"A - r~ -t- — 

^.CGurt «(iin.TAIC3 AAG7GGTGGA 

-77C3GGGCG 7TACCACCGG «rj"~-., ,-. , 

AGA.^C.C L3CGAGGA7G TACGCTCGGCC CGGGTTCGGC 

_^^ 0 _CGC3 AGTGACCTGT TTTCGGCCGC 
GTCGGC37T7 CAG7GGGTGG TCTGG3GT™ GACGACGCGA 7C37GGA7AG G77CG7CGGC 

"""'"^ ^"GTGGCGTGC ATGAGCGTCA CCGCGGGGCA 
■ ■ ^ — . TG..TGCGGC3 GCCTACGAGA CGGCGTATGG 

^ GAC3CC ~ ------ tgcgtacggg ggcacggcgg cgacggcgac 



1260 

1320 

1380 

1440 

1500 

1620 

^690 

1V4 0 

1800 

I860 

1920 

1980 
204C 
21 CO 
2 15 0 
2220 
2280 
2 3 4 C 



2 4 : 



2 CAT' 



AGGA ACATCGTGTT 2 52C 
-~2A GCACCTT3CA J5GC 
-G2 2 2GGCGCAAAA 



WO 99'42(P6 



PC I ,rSM9 0326S 



2940 
3000 
3027 



129 

GACCAGCCTG ACCAGCGCCG CCCAAACCGP C C— r-~ 

- A.GCTGGGCG GGCTTACCGCT 2830 

GGGGCAACTG ACGAA^Arr^ 

WUG ' ' c — MGGTT AGCAATGCGT TGCSGATGCC 
■3CC3CGGGC5 TACGTAATGC ^rr~ir~ 

" GTU ^ CGCKCCGGG TAACGCCGAT CCGCACGCAA 
TGCGGGCCCT CTATGCGGGC AGCGATC 

(2 J INFORMATION FOR SEQ ID M0:1U. : 

(iJ SEQUENCE CHARACTERISTICS - 

(A J LENGTH: 3 96 ammo acids 
<3) TYPE: ammo acid 
(C) STRANDEDNESS : 

TOPOLOGY: linear 

^EQUKNC" DESCP.IPTION: 5EQ ^ . - — 

Vdi Vai Ala Leu Pro ^ n , 

* ^ " ° iU " e Asn 2er Ala Ar? Me- 

TO " 

lyr Ala GIv Pro g^ s^- «^ c 

" 20 " ^ " eu Val ^ a Ala Ala Lvs Me^ ^ 

25 30 

As? Ser Val -a 3er As, Leu P k 0 , - ^ 

35 " *; ^ d ^ a Ser Ala Phe Gl;: 3 er 

40 45 

Val Val Trp GIv Leu Th- - i , . 

SO ^ ^" : — Ser Ser Ala GIv 

6C 

Leu Mo" "ai • ' - - ' 

'-■ .-^a .Ua S»r --^ , , 

n.a .-p ve: Val Thr 



■3 0 



_ a 



^a v a _; 
9C 



IGfj .... - J - * - -i- Aia 



^- -i ; . U 
140 



WO 99 42(T6 



PC I IS99 .03268 
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195 205 



Gin Gin Leu Ala Gin 



Ly = SGr Ile T -P Phe Asp Gir, Leu 

" b 220 



Ser Glu Leu Trp Lys Ala He Ser Pro His Leu Ser 



225 230 ^ nis Se r Pro Leu Ser Asn 



235 



240 

He Val Ser Met Leu Asn Asa His Val Ser Mer Th. a* c 

^ jer Mec Thr Asn Ser Ulv Val 

250 25 : 

Ser Met Ala Ser Thr Leu His ,er Met Leu Lys Glv 



2S0 



265 



Ly Phe Ala Pre Ala 



270 



Ala Ala Gin A, a Val 31u Thr AIa Ala «» As, 3 ; y Val Gla Aia Mec 



— Ser Leu Gly Ser 



29C ., c " ° er 3er L - u 31y Ser s»r '31 v „eu 

300 

^ ^ ^ ^ ^ A:a Ala Se, Val Gly Ser 
Jx3 320 

Leu Ser Val Pro Glr. A' a -~ 

J "~ a ^ a ^ a Gla Ala Val Thr °ro 

33D 335 
^ ? ^ ^ - ^ ^ Ala Ala Gl, Thr 



345 



350 



Pre Gly y e: - Leu 

355 



~eu ?r:; L-u Gly Glr: Leu Thr As:: 



" r 3-lv ?h- V - • 

3 - j * " ;/y ' rtC;: Arg yet ?rc 



Arg Aia r- r y a ; 

^ 390 

::;f':?mat::m ?gr ge; :r :; 



* - 7 '-'^ ?r o Ala Alc» G1-. 

3?5 



:-..:jk:;gv g:iaragt::k ; 
^ -e:;ctk . : t ... . 

TYPE; nuclei- Jc:;: 
■ 1 l-P-LCGY l::iea: 



WO 99 42076 



PCI 7l'S99.'U32ftS 



raTaACi '' ^tcco^ CMOCatCK! 

mKmC ' ****** °— cc M 

CACCCAGGCC GTCCTTACGG CCACCAAC"- n-™-™ 

CTTCGGTATC AACACGATCC CGATCGC3TT 
«CC= Kai= =AraiTI „ TCCGTATGTG MM gccct=ocaa 

^ OTGC3CTTA — , CGTC=STC ^ 

™ TO C 0C „ GCACGACGAA CCCGATCTTC ^ rQCCCr 

CTCAACACCG GTTGGCCAG"" — 

^ ._ C .GGC GGCTACCCAG ACCCTCGCCC AACTGGGTGA 

3ATGAGCGGC -CGATGCAGC AGGTGACCCA 

„ ^v^i^^.^; CGTTGTT'CAG 

~ W *""~~ 3 -S-AACCC AGCCGACGAG GAAGCCGCGC AGATGGGCGT 

AGTCCGCTGT CGAA C -AT"C GCTGGCTGGT GGATCAGGCC CCAGCGCGGG 

— CTGCGCGCGG AGTGGCTACC TGGCGCAGGT GGGTCGTTGA CCCGCACGCC 

_ " GCTGATCG ■" AAGCC3GT T — — — CCGG CGGC7GCTGC 

-^^CGCTGG GGTGGGTGGG GGAGCGATGG GCCAGGGTG" 

— — GG-T GGTGGCGGGG GGACGGCTCG GG G AG G AG CG 

GAC 3ACGAGGAGG AC""GG~A~~ " - 

— — M^^JGAG 7GGTGAGC7G GGG7AA7GAC 

^ — ow^ - CGAAC AT777GGCGA GG A AGG7 AAA 

AG7CCAGGA7 1GGAGAGA7G AAGAG7GA7G C: 



jC7ACCC7 ggcgc 

-jaGljGAT ~7GGGG'"" _ * 



-^ZZZAC G7GAAAAGCG AGATGGACGA GG7GGAG7; 



AGGAAAAGAA A G G G A G CAAA AACA7GAGA :- 



— vG 0GAAA7 



::-gpmag:g:; fgp. geg :r . N ,. 



300 
360 
420 
480 
540 
600 
660 



84 0 

90 0 
96 0 
10Zr, 
103 0 



1320 



" " 1 ' A GGGA n - 

'GGAGG OGGCGG GA 



WO 99/42076 

VCY.VSWOMbH 

132 

<*i> SEQUENCE DESCRIPTION: SEQ ID NO:H3. 

CTAGTGGATG GGACCATGGC CA-^T^r— 

^'^^^^ AGTCTCACTG c^^rrr-T. 

tcc3ct=c;:;i taicotcot 

A^C^ CCTTC3TGCG GCCC^ C CTTGTCGTAG ^CGCC 

TCAGAGTGCG CTCAAACGTA TAAACArrar „ , * 

TAAACACGAG AAAGGGCGAG ACCGACGGAA GGTCGAACTC 

GCCC3ATCCC GTGTTTCGCT ATTCTACT^" izr-rr-r- 

A..CTAC.CG AACTCGGCGT TGCCCTATGC GAACATCCCA 

STGACGTTGC CTTCGGTCGA AGCCATrr- r.r. 

AC.JYTTGC .GACCGGCTT CGCTGATCGT CCGCGCCAGG 

TTCTGCAGC3 CGTT3TTCAG CTCGGTAGC ~r-~ ~- 

v.- v/TW^CC ATTTTTGCTG GACACCCTGG 



T.VrORMATIO;; FCR 5EQ ID MO:1 , 4; 

I-) SK2'JSfJCS characteristics. 

(A) LENGTH: 363 amino acids 

*iPa: amino acid 

TOPOLOGY; linear 



^-"* £ '--^«: ic^ 10 NO ; 114; 

Me- Leu ~ — ^ - - 

- • Me: ; -j — 

1 5 "* w AQd ^ Thr ^ Arg Le- Me- 

13 

lb 

A ^ a G 1 v A i .a j ' - - rj *- ^ ^ 

:: — A.a A. a .Ala Gly H r / 

" " 3 CJ 

3 Y ^ J ^ A ^ Thr A.a Ar, 

^ 45 

Le- Asn 3<ei v,. . - - . , 



60 
120 
180 
240 

300 
36C 
420 
432 



j.:. A^d A: a Ala 7', 



WO 99/42076 



1'C T/r.S99 032<)X 



Asr. Gin Ala Ala Leu Ala y-t- r i„ ,. , ^ 

145 fc" GaJ Va ~ ^ Glr " ^ Thr Aia Val 

155 160 
Thr Leu Phe 31u Lys Leu Glu ?ro Met ^ ^ ^ 

WU 175 

«y «. s. r „„ s „ Ihr Iht to n> pte G _ y ^ s e . p „ 

185 190 

Gly Ser Ser Thr P^o 1 n., /-i 

P-o Va_ Gl/ G.n Leu Pro Pro Ala Ala Thr Gin Thr 

200 205 



Leu Gly Gin leu Gly Glu 



Met Ser Gly ? ro Mec Gl; 
215 220 



Pro leu Gl:; Glr. Val Thr ^ - ^- ~ 

22S - ^ ^ Val G!y Gly Thr Gly 

240 

^' ^< ^ «» Mec 32;, Leu 31,- 

250 ' 



Thr ser Pro Leu s-r Asr. Hi3 Pr3 Leu ^ 



SO , c ■ "" la Gly G:y Ser G: '/ Pro Ser 

- 5 " 273 



■^a 31;, :. s < 



<! - eU ^~ Ala .ser Leu Pro 

2 3 0 

S=r Leu Thr Arc Thr ?r 



2~5 ' - ro Aj.a Giv Gl-.- 

~ 30 285 



leu Met 3^r G"" T pi- t- ^ ^ ^- 
^' ^ " e - — ->'s Pro Val 

ioa 

.. v ^ a ;;v Se 



3C^ ^ ^ a *er j sr Ala Thr Gl y 

31 y a; t Ala ?r: g; v ■ 



^ 32, 



_ ^ ■ .-i-a oer 

J J " 3 3 5 

- i r Thr Ar- . . . . . , 

34C ' "•" - a ? " ^ -eu Al, 31:: 

3 5 ~ 

- ; - - .\/.;: -I-.-,-- 

' ' ; ' : Trr. 



^ TV?" 1C1J . 



WO y9/420"6 



PC;"I7i;SMt)/0J2(»8 



Asn Phe Glu Arg H e Se- m*, « 

^ ^e. (j ^y /U;d Leu Lvs Thr ^ 

20 * ^ q 7 " rn ~ Asp Gin Val 

30 

~ ! " r - - - - - «, «, «. «, wr 

Tlr » Ma "* - - - - «. oi» 1 ^ ta Lys 

SO 

Gin Lys Gin Glu Leu Asp Glu lie Se>- ^ _ r7 a 

63 70 U Ile ^ Gln Ala Gly 

75 n 

ao 

Val Gl;; Tvr s^- 

^4 A* a Asd Glu G 1 " ^^r. ^ 

35 * VJXn vjxn Gln ^ Ser Ser 

90 95 



^-formation for shq :d N0:l1 ,. 



SEQUENCE CHARACTERISTICS ■ 

W TYPE : nucleic acid 
tC> S7RANDEDNESS: single 
(2) TOPOLOGY: linear 



^CotNCS 0E3C? ^~ — ^ 

. ai g NQ . 

— ^'rt.. Jn -^-AGGTGGAG 
j^j-jCCAGTGG I jCGC" 0^3^ ^ 

• "~ -EGGTGGT3G OOTTCCAAGA 

,JJ ^ J " ^ ^ " "~^ G AAG G AGGAACT"' ; >i 



AATACJAAAA 3AAA::-lA-r_- AAAAACA73A 



* - -»TCAGGCCC 
. '-»AAT GGGC7 



✓ L ^ ~ j HA;-/. C T E E " 

LENGTH: i? C am: 
I ~YPE : arp.mc a; 
- ; T?A::i;-E:jrjEGG 



7 G0 



WO l J9.'4207ft 



PC T I V^I 03268 



Ij3 



Ser ,eu « a 3 : y aln Tep 0Jy Ma ^ 31y Thr ^ ^ ^ 

Ala Ala Val Val Arc 



35 



Phe Gin Glu Ala Ala Asa Lvs Gin 
40 



45 



Leu Asp Glu He Ser 



50 



Thr Asn lie Arg Gin Ala Gly Val 



55 



60 



L >- Gin Glu 
Gin Tyr Ser 



Arg Ma Asp Glu Glu Gin Gin Gin Ala Leu 



65 



70 



Ser Ser Gin Met Glv Phe 
75 



INFORMATICS ?C R 3EQ ID N0:I1£ 



(3 
(C 
(C 



^ : 3 8" ba< 



e pair;; 



nucleic acid 
^HANDEDNESS: smgl 



TOPOLOGY; linear 
ZQUV;CE DESCRIPTION: SEC ID NO: 118; 

i — ^ * CI AC3CGAACTC GGCGTTGCCC 
* ° ^ " " " w ^ - 1 - '■ ^AAGCCA , ,^^ F1 



*rtTGCGAACA 
^i_:-;-C'JCTG ATCGTCCGCG 



- j-TGC GAGOTT; 



110 
180 




WO 99/4:0"() 



PC I I S99 '0326N 



136 



GGCGGGGGTT ^CCGATTGG CAT— Gcc ^ 
(2J ^FORMATION FOR S £Q r D NC:i20: 

(1) SEQUENCE CHARACTERIST— S • 

JAJ LENGTH: 20 amino *acids 

^ TYPE : a,;no ac ld 

fC) STRANDEDNESS 

fE» TOPOLOGY: linear 

(xi) DESCRIPTION: SE Q ID N0;120; 

;r — 3 ^ ^ -iy GIr. Val 



7 al Ala Aid Leu 



"FORMATION FOR 5 HQ : D : J0 121 : 
SEQUENCE CHARACTER IS — CS ■ 

(a: length.. 15 amirio ^ 

TYPE: amino ac^ 



5 

in 

II 



STRANDEDNESS ■ 

^ TOPOLOGY: l^ea 

seqltlvce 0E5cr:?t:— 



~v ~o : 

A- a Leu ^ i 



"~ Pro Sp^ 



- E CHARA CTER : ~ 7 ; j - . 
3 7RALT: ED ME S S . 



UO 9 1 >/42(T<> 



137 



iXi; SEQUENCE DESC^-d— nu 

— P-*ON: 3EQ ID NO; 123: 

Tyr Tyr Trp c ? 

1 - ro Phe Pro AIa 

5 t> AJ.a Trp Glv Pro 

10 

INFORMATION FOR SEQ ID NO: 124 : 

fl) SE 0UENCS CHARACTERISTICS ■ 

UJ 14 amino acids 

<B> TYPE.- am.no acad 
fCJ ST3ANDEDNESS • 
™ TOPOLOGY : u near 

;x:! SEQUENCE DESC^n— P m 

He Gly s e - - 

5 Gia Ala 7 d I 

FORMATION FOP GEO mo 

*' Jc v NO: 12b: 



SEQUENCE CHAP^CTERIS^ICS ■ 
(A) MJGTH: 13 an.no acids 
'Bl TYPE : anu^ acid 
fCI STRANDEDNESS • 

TOPOLOGY: ; lnear 

SEQUENCE :E3r-^3— v — 

SEQ T- NO : 12 5: 



Ser v-, . 

"~ ' <a * ^ - -'^a lie val ? r - 



-L-ORMATICN ?CP. 3E~ — 



sequence EiiA?^:--^-.-^ 

* * - - • am^rio acid 
■ - ■ 3 72ANTJ EE NES S : 

0 ] cnr --, „ 



WO t ) , ; , 420*'6 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 12 7 ; 



Ala Pro Lys Thr Tyr Xaa Glu Giu Leu Lys Gly Thr Asp Thr Gly 

3 

(2) INFORMATION FOR SEQ ID NO: 12 8: 



5 10 15 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 ammo acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: linear 

(XI; SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
Asp Pro Ala Ser Ala ?r-> v., i - n - - < 



be: Lou 

20 



a As p ? i" : 



Val Ser Phe Ala Asn 

30 



INFORMATION FOR SEQ ID NO; 12 9 



SEQUENCE CHARACTERISTICS, 
i A; LENGTH: 22 amino acids 
(3) TYPE: amino acid 
(C: STRANDEDNES3 : 
:Z} TOPOLOGY: linear 

SEQUENCE DESCRIPTION- 



31 v 



23 

information for sec id :,t 




DESCR IPC 



WO W42(P6 



PC I L S99 03268 
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(3) TYPE: amino acid 

(C) STRAND EDN^SS : 

(D) TOPOLOGY: linear 



fix) FEATURE : 
Pro or Thr 



(O) O^KH INFORMATION: /note- "The Second Residue Can Be Either 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Xaa Xaa Gly Phe Thr Gly Pro Gin Phe Tyr 
1 5 10 

^-FORMATION :GR SEQ ID NO; 132: 

.ii SEQUENCE CHARACTERISTICS : 



- * ammo acids 
ammo acid 



: '3) TYPF: 
C) STRANDHDN2SS : 
(0) TOPOLOGY: l iaear 

'EATURE : 

ID) OTHER INFORMATION: 



3m :r Leu" " " The - — rd Residue Can 3e Either 

SEQUENCE DESCRIPTION: SE^ ID NC : 132 ■ 
Xaa PrC x ™ -la T^ Ala Gly 

: information ?or sec id :;o:-, 

■■3) TYPE: amine acid 

:C: STRAi REDNESS 

1 ij 1 - ^ fO Lw w Y : - mea r 



iEQITNCE ."-iAPACTERI OTICS 
^A, ^ENOTH : is amine ac ' 
■:3 4 TYPE- ammo acid 



WO 99/42(T6 



l»t J I 0326S 



140 



) INFORMATION FOR SEQ ID NO:I35 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 15 ammo acids 
(3) TYPE; ammo acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 : 

Ala Gly Asp Thr Xaa He Tvr t X « Val rlv a _ T 

1 s * i - /ai G1 y Ann Leu Thr Ala Asd 

INFORMATION FOR SEQ ID ^0:136": 



LENGTH: 15 am:r.3 acids 
^; TYPE: amino acid 
(C; STRANDEDNESS : 
^ TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



Ala Pro , ^ 



Gly Gly Thr Val Gin Ala Glv 
10 



INFORMATION EC?. SEQ ID :; C . — : 

i SEQUENCE CHARACTERISTICS 

A: LENGTH : : i anuncTLrids 
3: TYPE: ammo acid 



^QUENCE IESCR T T ■' v »t . - ^ ~ - ^ v ^ 
— He Ala "\'r xaa Thr ' 



Ala Gly /al r>^ ~i 




WO 99 42076 

PC17l ; SW03268 
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GGAACGGTG7' "G" GO—;' -J^;;— - - ~ ^ . . 

tcaccatccg aggga—— g gggtcaaaa;; gggtagaggg acaccaaaac gggtagcgag 

GGAAGTTCAT GCGGTTG: 



GCGAGGCAAC AGGAGGCGGG GGTCCCGCCG CGT^^GCr-G 
ATGATCCCAC CGCTGGATTG CAGGGCGGCA CCATTCCCCC TGTACAGAAC G7GGTGCCGG 
GGCCGGGTAC CTCACCGGGG OTGGGTGGGA CGCCGGCTTC GGCTGCGCGG GAAGCGCCGG 
CCGTGCCCGG TGTTGTGC-r GCCCCGGI'GC CAATCCCGGT CCCGATCATC ATTCCCGCGT 

A :: CO CCGAC j A rGC C'GGTGA 



TCCGGG3TTG GCAGCC7GGA AT.: ' ' CGAC 



CCAGGTCGGG L'ACGAGGC" ™A™;'G~ -.-^CGAC" — - 

^ ACJ; — - -gccaac:;ac ggogcggaco acgcgggtga 

" MwXA! ' ; AACGAGCGTG GGOOG GACGA :''GGr-GC-:\: GAGGACGGTG GGTCGGAGGA 
: ;V :J:: G AC GACGGTG GCTCCAGCCA 0'— OA - ; - ;/, .; ;aog*- ; GCTC 0 0CAGC 
C G A C G C A G GGCCACGGAA GAAGCAAGCC AACAGATG:- AACCCAGCAG CAGAGGGTGG 
- — GCTCGGGAG T CGCCGTCCG G 7GGGC3CAAC GGCAGCSGCG 

, '" , " , " T ' — --GGG TTCTGACCAO GG7CGCGGC7 :^-rA:77r G ; ;AJ7ACA0' 

07 :::r :; . :: : . 

I ■ 0 OOOGFN* OHAkACOK o : 07: . 

- 3 - : T'Yr: r : :n : " L e i r toiO 
:G: G 7 0 A NT r \ 0/ f J K 0 f ^ : 



120 
130 

30C 
3SU 
■12 0 



6 00 
66 0 
^ P 



WO W42(T6 



PC T/l : S99 .1)3268 



u; 



GGAGTarcGO ctjaccgg"" gaaggagcgg ,-, A . ;rA - r ...... .,-^ c . c „„. f , 

GAGTTGGAGA CGA.ACGACA GCACGGGGTT GTC0GC33AC 7GGGTGAAAT G ; '.".GGCGAT 
AGCGGCTAGG TGTCGGCTGC GGTCAACCTC GATCATGATG TCGAGGTGAC CGTGACCGCG 
CCCCCCGAAG GAGGCGCTGA ACTCGGCG7T CAGGCGATGG GGGATCGGTT GGGGGAGTGC 
CCAGGCCAAT ACGGGGATAC CGCKJTGTCWA AGGGGGCGCG AGGGCAGCTT CGGTTGCGCG 
AC^GTGGTGG GGG~GGCGrG I'A J GG'CGTT GTG^TCGAAG A C G A G T A G ~ A ' ; ;~" ;;• 

: ^format icr; pop pep :p :-jo:ho : 
gpo;iei;pe c:;ARAr-H;.:i pt:op : 

(A: PE::GTK: : : ^ Pas- :.. llir :; 

TYPE : ^rid 
C; GTRA^DEPNEOP : s i nq 1 o 
(1^ TOPOLOGY: ] iuear 

< — :-:0LF-CjLI- TYPE: .c:,:!:^:: # 

x : gegp;ppp-pg peocpipt: gp ; p : - :: - . . 4 

J "" /A/;ATT ^ ATA - .;::p.;-: gggotaacao 

;ggggogtga :?;tpppa ; — 

'• • : ! " - * :! " :, ' V,M '"- APjTAT'"." pgagapggtg 



480 
540 
GOO 
6C0 
720 
8 0 
SIS 



™»0/42076 PC , LS9,(,^8 

14^ 



TTCRRACAGC AGGA7TGAGG AGG3GACGGA ACGTGT'JGTG -;; ;;AATT-; : rA<GJAGC"— 
--TGAAACCG CTGGCCGGGT GCTCAGTGCG CGTAGGTAAG c:\GG7— v- - -AG ^CCGGGG 
GGCCGGGCGA AT AC GAG C AG ATGGG AC AG G GAATTGCCGC CGAGCGGGT- GGAGCCGTGG 
ATACCGCC3G CACACTCACC GGCAGCGAAC AGG:CTGGCA CCGTGGCGGC GCCGGTGTCC 
GCGTCTAGTT CGACACCGCC CATCACGTAG TGACACGTCG GCCCGACTTG C,\T—(:CTGC 

gttcggcacg ag 

::;fgrmat:cg: for 3eg 



■i 5E0UKIJC:-: GHARACTHi^ I iTI' I. GG : 

i A! LKNGT'-i : >• :; base pairs 
: !■ ' GYP- : taici- : ,-; aci.i 
;C: 5 7RAi;GRnGE;G~ - ; ; . • 

TOPOLOGY: : mea: 

[11/ MOIGGGI.K GVIG: :;r;A (gp : i:,T lc ;. 

Ixi; S IvOGJENCG DESCRIPTION: GRQ TT GO: 141: 

rccGGA ttcggcaggg tgta-ggcg ;g:- i;- g-a:; gggggaggag tgoc;acgag 



;GAGT CCGCCOTGGG OGGOTCTTTG OTTGGGGAGG GT"A T~'~"G A-" GTGT ITCCGO 



3d0 

102 : 

7 080 
114C 



WO 99/42076 



PC 11 S99 03268 



144 



:;t- ■ 



Asn Ala Vai Vai Ala Phe Aid Veil II 



Gly ?he Ala Sor Leu Ala Vai 



10 15 



Ala Vai Ala Vai Thr He Arg Pro Thr Al 



20 25 



ri Ala Ser Lys Pro Vai Glu 
JO 



Hly His Gin Asn Ala Gin Pro Gly Ly 3 Phe Met. Pro I.eu L,u P-o Th^ 
''-in Gin Ala Pro Vai Pro Pro Pro Pro P:o Aoo Aod Pro T"r Al * 



Gly Ph- -In Gly ;ly T: 



a V:il r.. ; - v Vll pr , : 



^ - ::r Pr :■ Jlv Vi 1 ... . 

' ' — ^ - oei ^ j ro A . -\ p>-^ 

Ala Pic Al., Vai Pr, Gly Vai Vai Pro Ala Pro Vai Pro t < p,„ 

110 ~ ' 



■Oil Pro II. 



:t ~"- Pr ' Phe Pro 



12 0 



'/ Trp Gin Pro Gly Met: P — 
125 



^r.r Ho Pro Tor Ala P; . p : 



" '-'^i Thr Thr oer Al ^ T- 

ro" Pro Pro Tor Thr P: y. , - r u,- .■„ 



'-i 1 Thr Thr Pro Pro ohr 



Lv5 



1 6 0 



130 



Th: 



I'hr Thr 



WO 99/42076 
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;i ; SEQUENCE OHARAOTEP TSTICS : 

( A ) LENGTH : i 7 4 a mi no acid g 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

in) MOLECULE TYPE: peptide 

ixii SEQUENCE DESCRIPTION: 5EQ ID NO : 1 4 3 : 

Ilo Ann Gin P:-o Leu Ala P::j Pi;> A] a Pro p-n a^o Pro p- 

j ; ^ 

Pro Ara Pro Pro Val Pro Pro Va ; Pr- v : . } : Jf . u j )rri : > r; , s . 



:o Th. 

8f; 



WO ')') 42076 



rCT l'S^ 03268 



146 



11:: Pre; Ala Glu Yu : Pgr A:;p ;.A:i Ar:j YaP .:or Civ Lou Thr Glv 

1 10 15 

Ala Val 'Jin Pro Sor Pro Ar^ Ihr Thr Ala Glu Acp Pro Arq Pro Arg 
20 2b 30 

Asr: Arg Arg 
3^ 

TTIFOPMATTOr: E P SEQ TP) EG): 145: 

: gep;gk:;p , e p } i a p a g t e ? i p t t ■ ' \p 

■,A ; EENGTH : 104 a^:r.o ^:nr; 
H } TYPE : aininu au l :i 

GTEAPJEEDNESP : Jii:gi- 
P ' T'^PP LOGY : \ : 

YPLEG"GPE TYPE: p-pv.;.de 

■,x: 1 PEOEEUPE PEPCE I PT 10" : PEG ID tPO : 1 4 P : 

Arg Ala A.gp S-.-r Ala Gly Gyg Thr Pyg Ar;; Trp Cyr; Xda Pro His Glu 
: 5 — 13 . 

Gyu Arg Ar'.j Pi- Ala Mot; Arg Glu Gl:; Kir, Gly Por Arg Ser Th: Thr 

::o p- 30 

: r '" ro ' :: -l pr - Ar 3 Ar • A I i Arg Y^l Arg Pre Glv Arg Ecu 

45 

Ph " ^ ro Ala -'±r A-o A-p Ya L I>ho Pr-; Pro Trp Pho Aia Aid 



1 ■ Ar/: A: ; '■' 1 - A: i ■■ o : Yal Trp Pro >laa Yu 1 Arr 

;:; 80 

; " Arp Asp Thr Gly y. ; Pyg P-g P:>. : Glu Arg Arg Ala 



WO 99/42()7 6 PCT rS99/0326S 



147 



(Xi j SEQUENCE DESCRIPTION: SEQ ID NO: 14b: 

GGATCCATAT GGGCOAT J AT GAT CAT CATC ACGTGATCGA CATCATCCGG AC 

(2) INFORMATION FOR SEQ IP NO: 14 7: 

( i } OEQUENCE CHARACTERISTICS : 
(A) LENGTH r 42 base pairs 
(3) TYPE nucleic acid 
;C) STRANHEONESC : single 
:nj TOPOLOGC: .Unear 

•ii> MOLECULE TYPE: orhor nucleic acid 
■vi OPIGINAL COULCE: 

• CC^GAATTCA GGCCTCGCTT 



MAT ION EOF. GEO 10 NO: 14": 

SEQUENCE CHALACTEPICTICS : 
(A, LENGTH: 11 ba.;.-^ pairs 
:E/ TYPE: nucleic: acid 
'C: ETRANOEPNESG : 3inqlc 
::='! TOPOLOGY: ; :;:-:ur 



WO W42076 



US 



GTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 

(2) INFORMATION FOR SEQ ID NO: 150: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 tiacc pair:; 
■3) TYPE: nucleic acid 
(GO 7T!A::OEDNESS : cinq!.- 
TOPOLOGY: linear 

. : : ) E:-'LL : :G-E. other ^ > 



. X 1 _> 1L L,' E N O ; ■ . 0 E 5* O p 7 PT I> "> ?-J ; 
G AT 0 C A G CG C T G A 3 A T G AA C ; A O O " A T 7 



. 3 3 Dd. 



' G r 0 L C i G Y : line ;-i r 



PC T/LS99 03268 



3 7 



WO 99/42076 



P( T l ; S99 (M2(>8 



14 l > 



uG NAME /KEY: Cw,: 

uP; LOCATION : I^u 3 

(xi) SEQUENCE DESCRIPTION: GEQ ID NO; 152. 

TGTTGTTGGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 

AGCATGCGGA AACGGGCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 

G C G G AAATT G AAGAGCACAG AAAG GTATGG C GTG AAA ATT CGT TTG CAT ACG 

Val Lyr, He Ari Leu His Thr 



GTG T'l- GCO GTG TTG ACG P ' '.' ; ; ; OCG GTG GTG CTA G :A GCG GGG GGC 
Leu P-u M i Val Leu Thr Al * Ala >c Lou Leu Leu Ala A] a Ala Glv 



60 
1 20 
112 



- — - 1 - - — ■ • ..-i -^A.-v ACC G ; ' ' GGG GGC j, 0 

Cy ^ Jl >' — - l 'V> ]J -" : ' --7 Lr- Glu T:n G.v Al a Glv 

J£ 

»~ T GTG GCG ACT ACG GCV GPP TCG TCP C'CG GTG ACG TTG GCG GAG 31 £ 

Gly Thr Val Ala .h: Th: Pre Ala P^r Ser Pre Gal Thr Leu Ala Glu 
40 ^ <->C b5 

ACC -™ AGC ACG — 3 CT:: ~- ; GTG TT- AAG GTG TGG GGT GCG GGC 

Thr Glv Cer Thr Leu Leu Ty. Pi.; Leu Phe A:;: L-u Trp Gly Pro Ala 



PL- Clua Au : Tvr Pre Ar.u G. t ; Tnr He Tin a, a Glu Glv H * ; 1 • 



Ta 1 A jr. : 1 e J ; y 

:co 

"A" A A ■ 1G 



NVO<>94:(r6 KT.TSW/M26S 



GAC GGG TCC GAG GGG TCC GG T GAC ACC TTC TTG TTC ACC CAG TAG CTG 
Hi-, Arg Ser Asp Gly Ser Gly Asp Thr Phe Leu Phe Thr Gin Tvr Leu 

190 195 

TCC AAG CAA GAT CCC GAG GGG TGG GGC AAG TCG CCC GGC TTC GGC ACC 
>r Lys Gin Asp Pro Glu Gly Trp Gly Lyn Sor Pro Civ Pho Gly Thr 

210 215 

' gcg ctg ggt gag aac cgc aac 

Ala L,iu Gly Glu Acn Gly Asn 
aja 'cg :;gc tgc gtg GCG tat 



200 205 

'"hr Val Asp Phe Pro Ala 7a 1 Pro 
22 0 

lgg ggc atg gtg acg ggt ?g G2c 



: 3 o 



1 ■ e 3er Phe L-u Asp G 

2 SO 



GGA GTG GGG GAG 
Gly Leu Gly Glu 



- J=> 2 6 0 



GCG CAA CTA GGC AAT AGC TGT GGC AAT TTG TTG TTG CG^ GAG GCG CAA 
Ala Gin Leu Gly Asn Ser Ger Gly Aju Phe L-u L-: ?n Anp Ala Gin 



In AI :» Al a Ala Al a 



;cg A A ~ 
.la Asa 

29i3 



1 ^ 2 2 t -> T 



796 



WO 99/42<T6 



I'C I /I S99/0J268 



. gatggg tgggatgago cpcaagggpt tgcatttctt caccgogacg gaatggaatc : s i i 



OAGGCAAOAO CTACGGCGAA ACOGTTGTGA CCGACGCGTC CCCCATCCG3 TCGGCGCCTA 
CTACGGGGCG TTGCCGCTGA TCGTCGGGAC GCTGGCGACC TCGGCAATCG CCCTGATCAT 
CGC G3TGCCG GTOTCTGTAG GAGOGGCGOT GGTGA'i'GGTG GAACGGCTGC CCAAACGGTT 



w.) INFORMATION POP CKQ ID NO : 1 r -> 3 : 

UP spquengp charagtlri otics : 

(A) LENGTH: 3 7-1 a:-;n: acii 

TYPE: amino acid 
fD) TOPOLOGY: linear 

, i : . MJ-LEGULS TYPP: pr o i:; 

{:■::; SPgULNCP OPPGRJ P7 ICG : ~ „. IP 

'•' 1 ' : V- : — •-' ArG L-r.i :. : .: Thr P-v; P-"; A". ; 

P"i; AO\ ;-P i ;P;i g; ■.■ G '.' P- * 



L 5 3 



3C 



1 S7 ^ 
1633 
1653 



GGCGGAGGGT GTGGGAATAG TGGTGGAATT GCTCGOGGGA ATCCCCAGCG TGGTCGTCG3 17^3 



i3i: 



P30T CCCGAT3TG.: OOGTO^TGAA PTA' "PTPGG .; PGPPAGPCGG GGAACGGGGA 1373 

TGGTGTTGG" TT:\":ATGOT ; yrTGOPATIA TGGCCACCAC 1 0 3 3- ' 



WO 99/42076 



PCT/l'S*)') 03268 



13 C 



1 1 S 



14 ■ 



Leu Asn Gly Lys Val Leu Ala Ala Met Tyr G 1 r. Gly Thr lie Lys Thr 

145 150 ' 15 5 " " iso 

Trp Asp Asp Pro Gin He Ala Ala Lea Ami Pro Gly Val Asn Leu Pro 

165 170 175 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 

-GO las 19 q 

Phe Leu The Thr Gin Trr L-u Ger Lys Glu As;: Pru G.I u Gly Trp Gly 

20 0 2 05 

Lys Ger Pro Gly The Gly Thr Thr 7a i Asp Phe Pro Ala 7a 1 Pro Gly 



::1 ' V - v A -" r - A — ^7 •'■■^ '-'a : Thr G;.y Gys Ala Gl u 



i: A - a ~'y: He Gly He Ger ?he Leu Asp Gin Ala 



2 SO 



255 



"- r ' : " Ar 'J ' J1 V : '- u ;j: V Glu Ala si:: Leu Gly Asn Lor Ser Gr^Asn 

2 b 0 2 (73 270 

A:: ? A - 1 12'- 1 31:; Ala Ala A. a A±a Gly Phe 

— 260 2H5 

L - a ''" r - J l- V ' — ^ : - Ala lie Ger 11^ Asp. Gl'/ Pro 

2*3 3 2U 



,sp 



! 3 5 



153 

TGTTCTTCGA C GG C AG GCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTG TCCTCGCCGA 
AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGr 
GCGGAAATTG AAGAGCACAG AAAGGTATGC CGTGAAAATT CGTTT3CATA CGCTG7TG1C 
CGTGTTGACC GGTGGGCCGC TGCTGCTAGC AGCGGCGGGG TGTGGOTCGA AACCACCGAG 
GGGTTCGCCT GAAACGGGCG CCGGCGCCGG 7ACTGTCGCG ACTACGCCGG CGTCGTCGGC 
GGTGACGT TG GCGGAGAGGG GTAGCAGGCT GCTC7ACCCG CrGTT<:AACG TG7G^'?C~ 



120 
180 
24 0 
1 0 0 
!>" 5 



GOO 



C C T T . C A G GAGAGGTATG G JAAGG'l GAG G AT C AC CG CT CAGGGGACGG GTTCTGGTGC 4 0 
. _ _ • 

: - ; -j-ct^ggagg gctatgtgt .: 4 - . 
ggaaggtgat ATGGG^GrGc aga.agggg ct gatgaacatg ■id?/?:^-::.:;, TCTCGG~~'" 
ggaggtgaag tacaa cctgg gcggagtgag cgagcacgtg aaggtgaagg caaaagtcct 
ggcggg gat 1 tacgaggg2a ccat 2aaaac gtgggacgac ccgcagatgg ct3cgctcaa 

ggccgggttg ggcacgacgg tggacttgcg ggcg:;t3cgg ggtggg,— ~~ gtgagaagg , 



WO 9i) '42076 



l'( I I N9<> 0326S 



:'G3CGCTGAT GATG3C3GTG GCGGT3TCTG TA3GAGC3GG GCTGGTGATG GTGGAAGGGC 



I 6 d u 



GGCCGAAACG GTTGGCCGAG GCTGTGGGAA TAGTGGTGGA ATTGGTGGGC GGAATCCCCA 1-4 0- 



GCGTGGTCGT CGGTT TGTGG GGGGCVATGA GGTTCGGGGC GTTGATGGCT CATCACATCG 

CTCCGGTGAT CGCTGACAAG GGTCC:;GATG TGGCGGTGCT OAAGTACTTG CGCGGCCACG 

GGGGCAAGGG GG^JGGGCA r :G TTGGTGTCCG GGGTGGTGTT GGGGGTG AT G GTCGTTCCGA 

™ATC3CCAC GACGAGTCA7 GAL'GTGTTGG G jGAGGTGG'."' GGTGTTGCCC CGGGAGGGGG 198 0 

G 3 A 7 C G G G A A TT^ 

2 :::format:g:; - -g-> ; 1 l- b • 

. i ! ^!-g::k:;'^- "i-MPA-'-p-Ti a; : 
>A> :GG;GTH : a-mr- 
(3) TY?S: cj;ui::t) ac\c\ 
(C) STRAiJT;KD!;EGS : 

(D: TOPOLOGY: linear 



: :icgk:;g^ 20; 3 crip 



l.'-v.i !,'ju G-rj A 1 1 A"! .3 Al ; ) G I. 



Thr Ala Al:i Pre 

P:,; S^:. GO/ G^r 
3C 

Th: Pro a:.,i 3^: 



Tvr Pn: 



1800 
I860 
1920 



1993 



WO 99/420~6 



vci. i syy.»u26s 



155 



170 :75 

Giy Thr Ala Val Val ?i j Leu His Ary Ser Asp Gly :>>- G]y Asp Thr 
180 135 19Q 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro GIu Gly Tip Gly 
195 20 0 205 

Lys Ser Pro Gly ! J h- Giy Thr Thr Val Asp Phe Pro Ala Val Pro Glv 
210 215 220 

Ala Leu Glv G 1 a Asu Gly Asn Giy Gly Met Val Thr Gly Cys Ala Glu 
22S 235 240 

2-i'-> 2 50 2 55 

br ' r Ar -' 7 {J -V : ' — 1-y Glu A: a Gli: Leu Gly As:: IJor Ser Gly As:: 

- dJ >CS 270 

.e.o siu Leu 1 r ; Asp Al.-t CLlu se: lie Gin A^a Ala A : a Ala Giy Phe 

2 7 s - y 0 2 8 s 

r - Qn ---^ 3U0 



WO 99/42076 



i'( 171 ;S99 .03268 



156 

G7A3GGAAAC G3CAACGCGG GC3GGT:3G3 CAGGTTGjTTG A 3 33 77 3 ATT TGGA7AArA7 :SC 

CTCGTGGAAG GTGAT3CCGT GGAATTGTGG C7:GGGAAC3 CTGGGGACGA GGCCGA7GCG 2 4U 

CTGCAACCCG GCAGCGCCCG TCGTCAACGG GGATCCCGTT CACCGCGACG GCTTGCCGGG 3 00 

CCCAACGCAT A 3 CAT T AT TG G AA C AA C 7 G T TGTATAGTTT GTCAACGCT3 GCCGCTACCG ^0 

agcg:cgcac aggatgtgat atgcgatgt o. TGOCCGGAGA gacaggagcg aggccttatg 4on 

acagoattog gggtcgagcc ctacggggag cggaagtagg tagaaatcgg :gggaagcgc a ho 

atggggtata tcgacjaajg caagggtgac 330AT07777 ttcagcaggg caactccacg 540 

TC3TG7 OA 07 '1'GTG j-'.I ..'AA CATCAT^.^G GA7TTG3AAG GGCTG T3 7 "3 303337333"^ 600 

7333A77T3A 7 7 -3 0A'7 ^33 33G37G33AG AAG373A33G 0A77G33A0O: CGAGCGGTA7 ^ r , 0 

GA337337A3 TGGTGGTGCA G3AGT33G30 7703333773 GG77G3AGT3 GGGTAAGGA3 7 8 0 

:AT33C3A33 GAGTGCAGGG 3A733337T7 A00O0AAGG3A T7GTGAG7CG GATGAC3TGG 84 0 

GAGOTGAGGG ACGAGGAAA7 0-AA7 "\A07A7 7303333:70" T"GTGAA3( 0''" 733rOAGGA7 100' 



WO W42(T6 



PCT/l'SWUIfcS 



)rmation fcr seo 10 ::o:i.sv- 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 24 ba<je pairs 
(3) TYPE : nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY . linear 

SEQUENCE DESCRIPTION: SEQ ID NO : 1 S " 



r 0 0 TT A C j C GO 0 T 0 0 S r ' 1 'J 0 OS Z " 



vTCAGGGAG 



00 OGGGGAC \ j 0 TT S 0000 A* 



att oaga 



(A; LENGTH: 1 3 H baco r>a ; 

(E) TYPE. ;mci.^r acid 

fCO STRANOEDITE^S : : ng : >? 

00 00POLOSO: ' ::>- ir 



WO 99/42076 



158 



PC I I S<>'> 03268 



GGAGGGGGGC GGTGGAGGC3 GTGC'I'GGACG GGGGGCAGAC GG7GGTG2TG GGTAAGGGGG 72 0 

GGATCGGGGA GAAGCGCTTC GAGGTGGCGG CCCACGAGTT CTTGTTGTTG CGGACGGTCG 7 80 

CGCACAGGCA GGCCGAGCGG GTTCGCCCCG AGCACCGCGA CGTGCTGGGC CCGGCGGCCG 
CCGACA3CAC CGACGAGTGT GTGCTACTGC GGGCCGCAGC GAAAGTTGTT GCCGCACTGC 

cggttaaccg gggagagcgt gtggacgcca tcgaggatct ccacatgtgg accgccgagt 

GGGTGGGGGG OGACG3G2T2 GACTTT GGGG CCAAGCAGAA ACTGGCCGTC TTGGTGGTGT 

gggggatcgg ggtgg^:3ag ggggtg :3g: tcgggcgtag gggcgagtag gggggttgca 

CCAGCTGGGT GGAG ' ^'GGGG OTGACG'"' "G."* ^GTT-Grrr:^ .------vv- ~ ---.^--^^ 



840 

900 
96 0 

l. n 2 o 

1082 

:. i -i g 
:20c 



...7 2. ; ' '''.y-' z^Gzrc-^:^] agggggtgg': ttgagtgg3g gggatg3g: 



TG 3CGTGAAT -r.rr.r;r GCG GGC AACAGGA GGA1TGGGGG CGCGATGCTC CGCGCGGCCG 
GGGG'GG AGC'G GTAGA A 

imfcrmatioi: skq ig moii,9: 

: : 1 GEOUENGi: CHARAGTKR T GTGS : 

(A; LIGIGTH: 32 1 base pairs 
uG TYP-: nucleic acid 
iC: G TRAi JD EDIJE G S : siiu:ie 
(Lv TOPOLOGY; : men: 



1320 
1338 



WO 99/42076 PCT.TS99/032G8 



ix^; sf.q\:ki:cx DEGCkiPTi-N : sko :r> no : 160 : 

GAAGACCCGG PCCOGOOATA 'I"GATCGGCT CGCCGACTAC TTTCGCCGAA CGTGCACGCG 
GCGGCG^CGG GCTGATCATC ACCGGTGG™ ACGCGCCCAA CCGCACCGGA TGGCTGCTGC 
CGTTCGCCTC CGAACTCGTC ACTTCGGCGC AAGCCCGACG GCACCGCCGA ATCACCAGGG 
CGGTCCACGA TTCGGGTGCA AA3ATCCTGC TGOAAATCCT GCACGCCGGA CGCTACGCCT 
ACCACCCACT T3CGGTCAGC GC :TCGCOGA TCAAGGCGCC GATCACCCCG TTTCGTCCCC 
CAGCACTATC GGZTCGCGGG GT TGAAGGGA OGATCGCGGA TTTCGCeOG'J 7GC0CGCAGT 



TGGCCCGCGA TGCCGGCTAO GAIGGCGTOG AAAT^ATGGG 
ATCAGTTCCT GGCOCCGCGO ACCAAOAAO" GOACCGAOTC 

accgtcgcog gt 

(::; information for seq ;o n*j : ;gi: 

\ i ) S F GjUENC I- CHARACTER 1 G T I C G : 

(A) LENGTH: 5 3C*arrar.o anas 

(B) TYPE: amino acid 
{ C } G TR ANTE DNES S : 

;o) TOPOLOGY: linear 

(>::; SEQOENOR LEG OR 1 POLIO: SKO IG NO : 1 6 1 
Pt>- A: ,t G.: ■ 1 1 r, \ ;.; ; o, 



6 3 
120 
180 
240 

3 00 

.3 G 0 

"•• "AO" iGOPA H 



Gin Ar:« A: i 



- u /if; P A '■' .Ti \; P'ji: Trp Ar:: 

5 T 



O : A:-: 



WO 99/42076 
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160 



130 

Cys Asp Leu He 
145 

Pro Asp Arg Tyr 



Trp Asp Ala Leu 
180 

Trp Gly Ser Ala 
195 

Val Gin Gly He 
210 

Ala Asp Trp Pro 



Pro Gin Gly Glu 

Val Leu Pro Gly 
260 

His Tyr Arg Arg 
275 

Leu Ser Trp Pro 
2 90 

Val Ala Lrfu Val 



135 

Gly Men Gly Ala 
150 

Ser Tyr Gly Glu 
165 

Asp Leu Gly Asp 

Leu Gly Phe Asp 

200 

Ala Phe Met Glu 

215 

Pro Ala Val Arg 
230 

Pro Met Ala L»u 
245 

Ala He Leu Ar^ 



Pro Phe Val Asn 
280 

Arg As n Leu Pro 
295 

As n Glu T' T "r Ar ~ 



140 

Ser Asp Lys Leu 
155 

Gin Arg Asp Phe 
170 

His Val Val Leu 
185 

Trp Ala Asn Gin 



Ala lie Val Thr 
220 

Gly Val Phe Gin 
235 

Glu His Asn He 
250 

Gin Leu Ser Asd 
265 

Gly Gly Glu Asp 

He Asp Gly Glu 

300 

Ser Trp Leu Glu 



Ser Pro Ser Gly 
160 

Leu Phe Ala Leu 
175 

Val Leu His Asp 
190 

His Arg Asp Arg 
205 

Pro Met: Thr Trp 

Gly Phe Arg Ser 
240 

Phe Val Glu Arg 
255 

Glu Glu Met Asn 
270 

Arg Arg Pro Thr 
285 

Pro Ala Glu Val 



Slu Thr Asp Met 



Leu ?he He Asn Al 

1 ^ 



Pre Gly A. 
33 0 



^ y Arg 



Arg Asp -yr Val Arg Ser 

'l AC 



-i 'j 5 




WO 99/420 76 
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161 



Gin Arg Ser Ser T 



43 5 "' /r Le '" P " Ser G1 ' J Leu Vai Ala Ala Phe Leu T-p 

440 445 ^ 



Ala Gin ?he Glu 



- jlu Ala Glu Ara tt« ^ h _ - „ 

450 g * hr ^3 Arg Leu Aso Leu 

5b 460 

Trp Asn Arg Tyr His Glu Ser Phe Tin r 

465 47Q Phe Glu S er Leu Glu Gin Arg Gly Le u 

475 480 

Leu Arg Arg Pro He n e P-o - ln M _ 

435 ° In Gly ^ s S ^ Asn Ala His Me, 

490 495 

^ ^ V " ^ ^ ^ S « ^ ^ Glu Glu val Leu Al, 

Arg Leu Thr ^er Glu - — ^ - 

-„ n ~ ,eu 



His Asp Ser Pro Ala Glv A— a^- 

535 

:!iFGRMATION FOR SEQ ID >; 0; ; 52: 

sequence characteristic - 

!A) LENGTH: 234 arr.mc a — *, 

:b; amine acia 

* ; strandeeness : 

; 0 ) TO PCLCGY : linear 

xi sec/je::;:h ^c:p"r— 

Asn Jlu "^r Ala ?r-: .—t , 

- ' ' ' '"' e ~ iJ, "' : -~ Jr:: Ala An Pr 



Hi^ Ala Gly Hi^ Cvs 
30 



' /J ^P -1 Al. i 

•15 



~- / lou ? rc A - , - 

^5 , . a ^ : n 

5 3C 



WO f >9/42fr6 



V( 1 I S<>9 032(>S 



162 



Asp Lvs Asd 7a I Val Leu Gl- A^-- - ^™ T 

lJQ ' ; A ^ Hl - J *=? Leu Ala Arg Arg Ser 

135 140 



Glu Thr Leu Glu His Thr Pro His Gly ^ ^ g Pro Val prQ 

160 

Has Arg Gly Asp Asp Arg Phe His Glu Arg Asp Pro Leu Has Ser Val 



175 



Ala Met Leu Val Se^ Pro Va^ n.i m , 

' ^ ro Va ~ G1 - Ala Glu Arg Arg Ala Pro Val Va 1 

180 190 

31;: His Gin Tyr His Val Val Ala Glu Val Glu Arg He Pro Glu Arg 



200 



j-lU Lys '. r al Ser leu Lou Ala A ■ a " ^ ~ ■ * - ^ 

- rt — jiV oer Ar~ 



:rp Ala Glu Leu Val Arg 

:2 ^> 230 



Arg Ala His Pro Asp Gin He Ala Gly His 



235 



24 0 



Cln Pro Ala 31a Pro Phe Gin Val Arg His Asp Val Ala Pro Gin Val 
24S 250 255 

Arg .Arg Arg Gly Val Ala Val Leu ' ■ -s A— r 

-a- *hr Leu Aia 
-6b 270 

LJhS Val ^° ^ His Ala Leu Pro Glv Asd Phe 

*- J 2 3 0 



;formaticn -cr geg. :o nq : 



.3: 



-> ^ v LLL r ^ _ --lARAGlLR 13T2 23 ■ 
.-rtj ^£!IGTH ; 2^4 base - 
3) TVPH: nucleic acid 
: C ) 3 THANE EDJfES S : single 
:l) TOPOLOGY. Ln.^ 
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(B) TYPE: nuciezc acid 

(C) STRANDEDNE55 : single 

(D) TOPOLOGY: linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
TAGTCGGCGA CGATGACGTC GCGGTCCAGG CCGACCGCTT CAAGCACCAC CGCGACCACG 
AAGCCGGTGC GATCCTTACC CGCGAAGCAG TGGGTGAGCA CCGGGCGVCC GGCGGCAAGC 

ACTGTGACGA CACGATGTAG CGCGCGG^T rr~r^n— ^ 

uLU ^ T GCTCCA.TGC GCGTTGGGAA TTGGCGATAC 

TCGTGGGTCA T G TAG GGGG T GGCCGCGT^ <™r,*~^„ ™ 

...A.^^v.. ^GTG^ATTC GCC^GACrCG 

^-^.GTTGGAGC GGTCA'^^GG' 1 " ' T Ar , ~*' ^^.^^^ ^r,^ „ , _ 

^ . i'anATG^GG TTTCGTGCGG CGG7GAGTCG 

— — — ™GGGGAAC JGCAGCAGGT GGACGTCGAT 'JC~v* — - 
-uuACon^ ^CAGGTCGGG AACGTCGGTG 

AT " CCA2 " GGCGCA3C3T TGCC '-" :;T GCCCAATTCG GCACGAGGCT GGCGAGCCAC 

GGGGGATCAC CAAGCAACGC 7TGC~~^G~A — 



^ — — ' - > — rlun 4 



jw-o . .- \T C GG AG ATT GAj 



- *~ "i^ ..7 . .'Art ' " 



:ggatcgtca GTTCGGCATC cgggagacca 

rCGTGCCTTG ACAAGAACGG GCGCAGATGT 

JGCAGTTGT7 CAATGGC7GC GGGCTGCGGC 

"AGCAAGGA TGCGAGTCTC GACGAACTCG 

-* vj ^-— --j^ j^j.^j'JA IG^TZGGCGG GTGATGTTTG 

~ J " J - ~-»vj_GGGGAG 3 GGGATGAG T 

- -j^-^- . — a A 3 rTGGGGG^G'" ""^ ':t , ~'^ Tii"^"^ — ^ , , „ 



60 

120 
180 
240 
300 
3 6 0 
420 
480 
540 
600 



'3C 



» - J* IV. 



ACGC CGACAT' 



3 6 0 



WO 99/42076 



I'C T t S99 0326S 



164 



GCAAAGGCGG CACC^JCZGG ^crr^r*^* 

— -CC^TGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 

ACGGCGGCCA AGGCGGGAGG ,r,rr^^^^ 

^ou^c Cwr^GCCGGC CGCACCAGCT 
.CACCCAACG CSCCGACGCC AAC3CCGGCA ACGGCGGTGA CGGCGG.GTC GGCGGCAACG 
GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCC 
(2) INFORMATION FOR SEQ ID NO : 166: 

(i; SEQUENCE CHARACTERISTICS- 

(A) LENGTH : 3 04 base oairs 
(3) TYPE: nucleic acad 
!C: 3TRANEEDNESS : single 
TOPOLOGY: !; M ar 

3SCU22;CS ASCRIPTION: SEQ 12 N C:1 6o. 

-^^v_ ATGGGCGGGC AGGGCGG TAG — — _ 

— — — -GCTCTACCG GAGGCGCCAA 

j^'jv3CCCAC GGCTTCACTC CAACCAGCGG — - 

— — ooCGACGGCG GCAACGGCGG 



- u^oGC AATGGC3GCA AC GGCG< 



^ .^^LGGGCC AAA~~GTAA^ — — — — ,~ 

-■- -i^^ ---o^^C rt ACGGTGGCGC 



« FO RMAT T G N v C R , EC :D 



LENGTH: .439 .asc - 
•3: TYPE: „cle:c acid" 
'C 1 GTRA!JTJEDNESG ■ :- rq ' e 
G % — p~r\~v. • 4 



— * --jw. rt . JOi , _JAGG.v - 



b 0 
120 
180 
22 7 



120 



* - ~j ^ j ju-AAGC _ _ 



10 0 
104 



WO 99/42076 I'C r l'S'>9 0J26S 
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GGCTCGGCGC CCGA^GGAG" - -JTJ 

*' AG ' -ATA^..^ _GCGACCA GGACCCGGAC GTGCAGATGG 

ACTACG7GA7 GGCCAA7CCG CC3T7CAACA 7CAAAGACTG GGCCCGCAAC GAGGAAGACC 



CACGC7GGCG C7TCGGTGTT CCGCCCGCCA ATAACGCCAA C TA CG CATGG A7TCAGCACA 
TCC7G7ACAA CTTGGCGCCG GGAGGTCGGG CGGGCGTGGT GATGGCCAAC GGGTCGATGT 
CGTCCAACTC CAACGGCAAG GGGGATATTC GCGCGCAAA7 CGTGGAGGCG GATTTGGT 
CCTGCATGGT CGGGTTACCC ACCCAG CTG7 TCCGCAGCAC CGGAATCCCG GTGTGCCTGT 
GG77TTTCGC CAAAAACAAG GCGGCAGG7A AGCAAGGG7C 7ATCAACCGG TGCGGCCAGG 
7CC7G77CA7 CGACGCTCG7 3AAC7GGGCG ACCTAGTGGA CCGGGCCGAG CGGGCGCTGA 
CCAACGAGGA 3A7CG7CCGC A7CGGGGA7A CC77CCACG~ 1AG~AC A 



480 
540 
600 
660 
720 



GGTAA7GGGG GCAC7GGCC7 CAACGGCGCG GGCGG7GC7G 
GCGGG3CCGG CG3CAACGCG GG7G7CGCCG GCG7G7CC77 CGG"A - ~~~r- G ~„ GCC ~ G „„ 
ACGGC3GCAA CGGCGGCAAC GGCGGCCACG GGGGCGACGG CAC3ACGGGC GGCGCCGGGG 
GCAAGGGCGG CAAC3GCAGC AGCGG7GCC3 CCAGCGGCTC AGGCGTCGTC AACG7CACCG 
GGGGCGACGG CG3CAAC3GC GGCAA7GGC1 JCAACGGCGG CAAC3GC7CC 3C3GGC3CCT, 

33GGCCAGGG GGGTGGGGGC GGGAGr — — — _ 

— ^^^o^.. .„.^g^G CCACGGGGGC GGTGCGACCG 

3CGGCGCGAG 33GCAAGGGC GGCAACGGCA ~AGGGG<:g:: 33GCAGCGGC TCAGGCGTCA 
-j^-ovtC^.^, ^o^oo^.^Co GCGGGAATGG JGGCAACG^G GGCAACGGG 

: ::;fcrmattcn for geq :d :;c.:6a : 

l SEQrjENCZ Gi iARACTER 1 3 T I G3 : 

'Aj LENGTH: 323 oase oan . 

."t?a:;ghlne3:; 



7an 
84: 

300 
36 C 

IG2 : 

1080 
1140 
1200 
12 6 C 

1320 

:. 3 a c 
: -i 3 - 



WO W42(r6 



PC I , l 'S l >9 (U26S 



[66 



information for seq id nc : 16 9 : 

.1) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 169: 

GCAACGGTGG CAACGGCGr;r i r^* ^r"" rr>^ ^.^.^-r^,^™-. 

AC^CmC^ _3TGGGGAT GGCCGGAGGT AA CTGTGG TG 

CCGCCGGGCT GATCGGCAAC 

\2' INFORMATION FOR SEQ ZD NC-l^ 

3EC T JENCE CHARACTERISTICS: 
I A/ LENGTH: S32 Dase cairs 
iB) TYPE : nucleic acid 
(C STRANDEDNESS: single 
(D; TOPOLOGY • linear 

X:: SEQUENCE DESCRIPTION: SE^ ID NO.; 70; 

GGGCTGTGTC GCACTCACAC CGCSGOA^" — ____ 

- ^ — ^o.^o^o o^^^^^^A^T ATCCAGCTCA 

-TTACCGTCC GAGG ACS GO'S SSATCAAGGT GC3GGTCAGC GCCCAAGGAA 

AC SGG CATS GAG GCCCTC'JTCG CCCGGCTCSG GCAGGATCCG 

^^jv_^vj\-j.. . ,~ * - • _ G S CC ATCCTGTGAG 



}TCAT CCAC: 



- J ^ .j ^ .._ 0 ^ 0 ^ _ _ .*\Cv_GCAGGGA 



60 
80 



180 



-w.Trt^^G^. ^ CAC J AG. GS AAGAC 




WO 99/42(P6 
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CGCGCCACCG 3CKCACCGG CGGAGCKCC ^AGCCGGCG GCGCCGCTGG CGCCATCGGT 

ACCGGCGGCA CCGGC3GCGC GGTGGGCAGC GTCGGTAACG CCGGGATCGG CGGTACCGGC 

GGTACGGGTG GTGTCGGTGG TGCTGGTGGT GCAGGTGCGG CTGCGGCCGC TGGCAGCAGC 

GCTACCGGTG GCGCCGGG TT CGCCGGCGGC GCCGGCGGAC AAGCCGGACC GGGCGCCAAC 

AGCGGTGTGG GCGGCACCAA CGGCTCCGGC GGCGCCGGCG" GTGCAGGCGG CAAGGGCGGC 

ACCGGAGGTG CCCGCGGGTC CGGCGCGGAC AACCCCACCG GTGCTGGTTT CGCCG 

C- INFORMATION FOR SEQ ID NO : i 7 2 : 

,i) SEQUENCE 'CHARACTER 1 3 TICS ; 

(A) LENGTH: 5 90 baap pa— ^ 
■'3 J TYPE : r.ucieic acid 
; C ! GTRAilDEDNESS : j e 
X) TOPOLOGY: linear 

> : SECUE.\*CE OHSOKIPTION: SEC} lO KG. 172: 

CCGAC3TCGC CGGGGC GATA 0 GGGGGT CA C CGACTACTAC ATCATCCGCA CCGAGAATC3 

^^-.G^o -AACOGCTOC GGGCGGTGCC GGTCATCGGA GATCCGCTGG CCGACCTGAT 

-TGAAGGTGA TCGTCAAC'JT GGG CTACGGC GACGCGAACT ACGGCTACTG 

GACGAGCTAC G CCGATG^G" GAA~™ ™ , „„„ 

^m.„._ 0 _ — CCGAACGTGC CGCCTCAGGT 



-'--.C.GGCCG 2CGG^ 



j - 0 -jTGGTGGCCj 0CGCAC2 
■ — v ■- \ : ! C AA C T A C G GGGTCGI 




JO 1 J*vG 



240 
300 
3h0 
420 
480 
535 



i AC AT C 



60 
12 0 

: 3 c 

3 00 
J 6 : 

4 2 'J 
4 60 



i:;f:'Hmat:g:j fgp. sh: :g :;c i — 



WO ')9'42(T6 



PCI I S { ) ( ) U3 2(>S 
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j 3 AAAA-A 3G AC 3 2 GT 3 A 

. . . . * - .r,jn .3Cj CAT 3 3 3 



;ecjence characteris/itcs 

Ai LENGTH. 219 Od^e ua:;; 
TV' TV?" r.ucle:.. ,c:j 

2 TEA? JE '£2 KE 2 2 _ ; ; j- 1 e 

E TSrOLSGY 



^ecyengi ees :t:^t: se: 



60 

120 

180 
240 

300 



AC3GTGACGG GGGTACTGGG GG3GGC3ACG GCGGCAAC3G GGGGAATCCC GGGTGGCTCT 

TGGGCACAGC CGGGGGTGGC GGCAACGGTG GCGCCGGCAG CACCGGTACT GCAGGTGGCG 

GGTGTGGGGG CACCGGCGGC GACGGCGGGA CCGGCGGGCG TGGCGGCCTG TTAATGGGCG 

CCGGCGCCGG CGGGCACGGT GGCACTGGCG GCGCGGGCGG TGCCGGTGTC GACGGTGGCG 

GCGCCGGCGG GGCCGGCGGG GCCGGCGGCA ACGGCGGCGC CGGGGGTCAA GCCGCCTTGC 

TGTTC GGGCG CGGCGGCACG GGCGGAGCCG GCGGCTACGG CGGCGATGGC GGTGGCGGCG 3 60 

GTGACGGCTT CGACGGCACG .\TGGCGGGCG TGGGTGGTAC CGGTGGC 40 7 

'2; I FORMATION EOR SEC ID flO: 174: 

1 2EQ33ENCC CHARACTERISTICS : 

(A} uENGTK : 46 3 oase pairs 

(B) TYPE: nucleic acid 

(C) STRAiTEEDNCESS : single 

(D) TOPOLOGY: linear 

fXl SEQUENCE DESCRIPTION: SEQ IE NO: 174- : 
JAICGGTGAG CGCATCGCCC TCGGCGGCAA GCGATTCCCC GOT ITCACCG AAGAACATCG 6 0 

TGCACGCGGC GGCG23GACG AGCCCGCTGC GCTGCGGCGC GTCGAACGCC TCCAGCAGGC 12 0 

GGATGGCTGr TTCCGrCAAC 3CATTGCGGC ACGG3C3CGG 3TCTTTGTGA TGCTCGACGA 240 

GG3C33C3AC GTCGGCGGC 3 AGGT~GT~GG GGATGTGCCG ;CGCAGGGGT CCGGCGCGAC ' ' ibC 



4 2^ 



UO W4207h 



GGGCGGCGAC GGCGTCTTTG CCGGT3CCGG IGGCCAGGGC GGCCTCGGTG GG CAGGGC3G 

CAATGGCGGC GGCTCCAC7CG GCGGCAACGG CGGCCTTGGC GGGGCGGGGG GTGGCGGAGG 

_^^o_CGCG GCTCGTGCC3 AATCC3GG~7 "lA.cr.ATGG AC AGCGCGGCCA AGTTCGCT3C 

■^TCA GGG3C3TA27 GCCCC3AACA GGTGGAACAT CACC CGAGT7 AGCGGGG CGC 



:::fcri-*iATI3n ccr sec id nc.:~-. 
i, sequence characteristics . 

.A, LENGTH: ::c Dase "air 5 
B: TYPE: nucleic acid 
3 : 3 T RAND SDNES 3 . :,;r t q 1 e 
70P3LCGY . Lir.oar 

. .;E3;~:;7Z legcr:?-: ::; ;c 
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GGGGGCZCGG GCGGCAATGG CGGCGCGGGC GGCAACGCGC AGGCGGCGGG GTACACCGAC 180 
GGGGGZACGG GCACCGGCGG CGACGGCGGC AACGGCGGC 
(2) INFORMATION FOR SEQ ID NO: 176: 

(1} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

ixi, SEQUENCE DESCRIPTION: SEQ ID NO:I76. 

TAGCTCCGGC GAGGGCGGCA AGGGCGGCGA CGGTGGCCAC GGCGGTGACG GCGTCGGCGG 5 0 

:AACAGTTCC GTCACCCAAG GCGGCAGCGG CGGTCGCGGC GGCGCGGGCG GCGCCGGCGG 12 0 

GAGCGGCTTT TTCGGCGGCA AGGGCGGCTT CGGCGGCGAC GGGGGTCAGG GCGGCCC3AA ig 0 



TCGGCGOZZG 24 0 



300 
360 
420 
480 
4^4 



;gccag ggtgggg: 



:ggtgccgg agcggccggc gccgacgc: 

j^.Cj^^jG _33GG3CGG2 GGCGTCGG' 



13 0 



WO 99 42076 PCT'i:S99/03268 
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(C) 3TRANOEDNES3 : single 
ID) TOPOLOGY: linear 

ixi; SEQUENCE DESCRIPTION: SEQ ID NO: 17 8: 
ATGGCGGCAA CGGGGGCCCC GGCGGTGCTG GCGGGGCCGG CGACTACAAT TTCCAACGGC 
GGGCAGGGTG GTGCCGGCGG CCAAGGCGGC CAAGGCGGCC TGGGCCGGGC AAGCACCATC 
TGATCGGCCT AGCCGCACCC GGGAAAGCCG ATT CAACAGG CGACGATCCC GCCTTCGTTG 
CCCCGTTGGA CCAGGCCCGC ATCACCTACG CTGACCCAGG CCACGCCATA ACGGCCGCGA 
AGGCGATGTG TGGGC7GTGT 3C7AACGGCG TAACAGGTCT ACAGCTOG7C GC3GACCTCC 
3GGACTACAA TC 



3c: 



IC^A uITCGCTGCC ATCGCATCAG 

-TCGAACA 

INFORMATION ?OR SEQ 10 NO: 179 : 

1 ; S E QlTENCE CKARACTE R I3TIC5 : 

(A) LENGTH: 400 base -airs 
(3) TYPE: nucleic acid 
IC' 3TRANOEDNESS : cinqle 
(O; TOPOLOGY: linear 

■ :<i 3EC T JENCE 0E3CR IPTICN : SEC IL NO 1^9- 

^ __CGCTGCTA GCCGCGCAAG 

AC G AG CT 



-A AGGGGG CACG GGGGGCACCG .CGGCAACGC CGGCGGCGGC 



.^■o -^-^voC. --iACGGrGG.A .-VJJGCGGTGA 3GGCGGGGTC jGGGGOAACT 

i OIA JCACCGC GGCCGCCGGC ACCACAGGCf 
j3 -3CG3CGGAA ;■; ;;;;;; jaA- IGGCGGAGCG 3CCG GCAC~" 



'n^rmatijn -cr se: :d nc : 



1 3 t j 



.-ECL'ENCE -TiARACTZRIGTIIG 
■A. LENGTH. 5 • ~ oaae lvi : * ; 

? type . — -: - - , ~- 



so 

12C 
130 
240 
300 
36 0 



60 



:4j 



WO 90/42076 ITT/i:S99/03268 
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AGCGGCAAC3 aC,CCCV.CC GCGGCAGC3G CGGCAACCCC GGCAAC3CCG GCATCGGCGC 
CAACAGCGGC ACC3GCAGC3 GCGACGGCG3 TGCCGGCGGG AACGGCGGCG CGGCGCGCAC 
GGGCGGCACC GGCGGC3ACG GCGGCCTCAC CGGTACTGGC GGCACC3GC3 GCAG CGGTGG 
CACC3GCGGT GACGGCGGTA ACGGCGGCAA CGGAGCAGAT AACACCGCAA ACATGACTGC 
GCAGGCGGGC GGTGAC3GTG GCAACGGCGG CGACGGTGGC TTCGGCGGCG GGGCCGGGGC 
CGGCGGCGGT GGCTTGACCG CTGGCGCCAA CGGCACCGGC GGGCAAGGCG GCGCCGGCGG 
CGATGGCGGC AACGGGGCCA TCGCCCGCCA C3GCCCACTC ACTGACGACC CCGGCGGCAA 
CGCCCCCACC GGC3GCAAC3 GCGGCACCCG CGGCACCGGC GGCGCGGGCA TCGGCAGC 
INFORMATION -CR SEQ ID NO: 13 1 

secjznce characteristics : 

(A) LENGTH: 2.19 base cairs 

(3; TYPE: nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: linear 

X1 ^UENCE DESCRIPTION: SEQ ID NO : 131: 
3G """™ : ' JCC3CGGG? ^ — iCGCCGGAGG CGC3GCGGGT GCCGTTGGGG 

33TGGGGCTn ->C3GTGCCGG AGC3GCCGGC GCCGACGCCE 



— g..,*. ...oo 3GT3GTAG.. i jGttcgctgg isgggccggc ggcgteggc: 

" JvJLJVKn -^^o_'. ECCTGGTGCC 3GC0GCAE ? 



:: format ' 



E3R SEQ ID NO 18 2 



3eq:;e:;cz characteristics 

■A: LENGTH: 39 5 base ::a : 

V. — - ■ 

- CTEANDEDNESE 

- ?~t,scy ; : n ,. , : 

J ectenee ees et: ;\- 



120 
180 
240 
300 
360 

420 

480 

533 



WO 99/42(P6 



-CGGCGGCC ^CC, ACCCCGGGCC GGCGGCAGCG CCGGCGCCGG ^ 
3GGGCCGGC3 GGGCC3GCGG CAACGGCGGC GACGGC3GGG ACGGCGCAAC CGGTGCCGCA 
GGTCTCGGCG ACAACGGCGC GGTCGGCGGT GACGG.GOGG CCGGTCGCGC CSCCGGC^ 
CGCGGCAACC CGGGCGTCGG CCGACAGCC AAGGCCGGTG .CGGCGGCO, CGCGGGCAAT 
GGCGGCAACG GGGGCGCCGG CGGTGCTGGC GGGGCGGGCG ACAACAArr, CAACGG CGGC 
CAGGGTGCTG CCGGCGCCA AGGCGGCCAA GGCGGCTTGG GCGGGGCAAG CACCACCTGA 
TCGGCCTAGC CGCACCCGGO AAAG C CGAT C CAACAGGCGA CGATGCCGCC TTCCTTGCCG 

GGCCGGCA7C AGG7AGGG~- _ 

•■ ■-■ ^v^Yi'AACG GCCGCCAAGG 

-G GG7G7G7GG7 AAGGGG3TAA GAGG— A-r 

~ C3GCCTGACC AT GG A GAG GG GGGGGAAG77 C3CTCCCATC GCATCAGGCG 
TACTGCCC CGAACACCTG GAACA 

lOTORMATlCK rCR ZD ::o:i 33: 

-eque:;ge gkar.^gter:3t:gs • 

gengtK: : 133 bdt;e pair3 

■•3. x rv?E: nucleic acid 

."TRATJSEOTES^ . 5:::gle 
7GPCGGGY ■ linear 

^ . _ . j_ y j . 



--•-^o^.w.,^ j GAG-" . 3 J GG A77GG GGGGG 
-GGAAAGG GGG7AGGAGA ?GG^ — 



j/wv^rt^.^ GATGGGTTGG GGATGGGGGG 



420 
480 
540 

600 
660 

720 
780 
B40 

yuc 

96 0 
985 



" 7 - ~ ~G G GGGA7 GAAA 7 '"AG*""' ■ - - - ■ 

. -;,G7 GTAGGGACGA TAATGGG-rAT 



WOWM20-6 PC T rS')') (IJ2(,S 



CGGATGGGTG GAACACTTTC AACCTGACGC TGCAAGGCGA CGTCAAGCGG TTCCGGGGGT 
TTGACAACTG 3GAAGGCGAT GCGCCTACCG TTTGCGAGGC TTCGCTCGAT CAACAACGGC 
AATGGATACT CCACATGGCC AAATTGAGCG CTGCGATGGC CAAGCAGGCT CAATATGTCG 
C3CAGCTGCA CGTGTGGGCT AGGCGCGAAC ATCCGACTTA TGAAGACATA GTGGGGCTCG 
AAC3GCTTTA CGCGGAAAAC CCTTCGCCCC GCGACCAAAT TCTCCCGGTG TACGCGGAGT 
ATCAGCAGAG GT CGGAGAAG GTGCTGACT7G AATACAACAA CAAGGCAGCC CTSGAACCGG 
^^lAACCCGCC GAAGCCTCCC """"'"Gr , ~A' T, ~ * i-^™^^,^,^, 

— ^ — — tw.ccc gcccccgcct ccgcaagagc 

AGGGATTGAT CGGTGGCTTC ~TGA-GC— — _ 

^^^"^ — — ACGG GTCCGGTGTG ACTC-G^TA 

GC 3GGATG'""'" -lGC^-"^ " ^ ^^^^^ 

J -^-^^CCGG 7TACCGGATC GCCGGGTGGT GGCCTCCCGG 

GTGACACGGC GGCGGAGCTG ACGTCGGCTG GGCGGGAAGC CGCAGCGCTG TCGGGCGACG 

TGGCCGTCAA AGGGGCATCG CTCGGTGGCG GTGGAGGCGG CGGGGTCCCG TCGGCGCCGT 

TGGGA7CGGC GATCGGGGGC GCCGAATCGG ^CCCZC TGGCGCTGGT GACATTGCCG 

GCTTAGGCCA GGGAAGGGCC GGCGGCGGCG CCGCGCTGGG CGGCGGTGGC ATGGGAATGC 

'-GA7GGGTGC GGGGCATCA3 jGACAAGGGG — — r ^ r^>~ 

J ^AAGGGTT— CAGCAGGAAG 

GTACAGGGAG G A T G G G G CA 7 3GACC3AGGC GG7CA7TGGT 

j --iAGTGAGGAT 3GACGAA77G GACGGGCA7G TGGC 



720 
730 
940 
900 
960 
1020 

:cac 

HOC 
I26C 
1320 
1380 
144 0 

i5o: 
156 : 

15 2 0 



WO W42076 



PC"! rS9«> 03268 



174 



(Ci STRANDEDNESS : 
(D) TOPOLOGY : linear 



(Xi; SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Mec Thr Gin Ser Gin T hr Val Thr val Asp am Gln clu Ile Leu ^ 



5 10 15 



Arg Ala Asn Glu Val Glu Al ^ Dm ^ 

20 ^ a Pro Met ^ Asp Pro Pro Thr Asp Val 



25 



30 



P« lie Thr Pro Cys Glu Leu Thr Ala Al, Lys Asn Ala Ala Gin Gin 

"* u 45 

Leu Val Leu Ser Ala .sp Asn Met Arg Glu Tyr Leu Ala Ala Glv Ala 

60 

65° ' ^ ^ a ^ Ser LeU Ar 3 As;: Ala Ala - /s Ala 

" 5 80 

^ *! ? G1U Glu Ala Ala Thr Ala Leu Asp A sn Asd Glv 

90 95 " ' 

?i:U T ^ ^ 3 GlU Ser Ala A - Val Glv Glv Asd Ser^ 

105 110 



.Lr^ Va. Ala Thr Ala Glv Glu p^- 

120 



A ^ 

Us 



Asn ?he ^e 1- " j*> 

130 ^ ^ ::: ^ a Ar ? ^ gi 7 as D 

140 



L 6 5 



Ar7 Phe 




WO W42(T(> 
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Val Asn Pro Pro Lvs Pro Pro o^o A]a r 

275 * ;;° 13 Ixe Iie Pro Pro ?rc 

285 



280 



Pro P,o oi„ G iu GIn 01y ^ ne pro Qly ^ ^ ^ 

300 



295 



Asp Gly Ser Gly Val Thr P-n ™ ^, 

305 ™* ^° G1 ' Thr Met Ala Ma Pro 

1U 315 



320 

Gly Gly Gly Leu Pro Ala Asp Thr Ma 
330 335 

»-a Gin Leu Th^ ^ ~ - 

Ser _a _y Arqr Glu Ma Ma Ma Leu Ser Gly ^ 



Val Pro Pro Thr Gly Ser Pro 
325 



340 



34! 



350 



Val Ma Val Lys Ala Ala 



Gly 31 v 



3b; 



^ ?r ° L - *-Y Ser Ma lie Gly Glv M, 



Glu Ser Val Arc; 



33 C 



385 



Ua Gly Ma Gly Asp 
390 



Ala Glv 



Y Leu Gly Gin Gly Arg Ma Glv 



395 



Sly Gly Ala Ma Leu 
405 



Met g: 

410 



400 

Met Pro Me- Gly Ma 
415 



^ His 



410 



- yj G^n 
* 2d 43 0 



3D Glu Ma 
4 3 5 



— ' — ^ -A o I.) 



a Tro Thr Glu 



.a , i . ; 1 e 



-"'J ^rn 



4 50 



46C 



INFORMATION FOR 0 ~Q l' 1 

^ *-PGLCGV 1 ; n 



13^ 



WO 99/42076 



K T,TS99'(U2(>N 



176 



Phe Pro Pro Ser 

- ~ - -^ii ucu nrg Arg Arg 

60 

'hr Thr Arg Arg Ser Gl 



A ^ ^ 9 ° s ln LeU val Ser Arg Glu Ma 



" ~ ^ *** H ^ Arg Cys Gly Trp His Pro 



70 75 

° 80 



Gly Thr Gly Ser His Thr Gly Ala n 

85 7 ^ a Vd * ^9 Arg Arg Hxs Gin Glu 

Arg Asp Gin Ser Leu Leu l, 
100 

Gly Gly Gly Arg Leu Arg Arg Val TVr Arc » 
115 

Val Val Phe GIv 

^ n ni _ s ^ eu ^ eij ? 

13 5 

His Arg Glu Asn Leu Val Ala Glv A - 



85 * ^ y ftL y ^3 "is Gin Glu Ala 

100 ^ ^7 Arg ^ Val Asp Leu Asp 

105 110 

he Gla Gly Cvc Leu Val 
120 125 

130 _~ " " eu :j,?1J *le Leu Arg Val 

n 



155 160 



rg Arg Val Phe Arg Val Lvc Pr 
155 

Phe Glu Pr-} Ag- — v ^ ^ u T1 

" ' a ~ ?he Ue Arg Met Phe Pro Pr 

lb a 

Mio Val Gl n ^eu Art; \s r — ^ - „ c 



" '-a ™e ^ro Pro Ser Pro 

U 17 5 



130 ; q : Leu Gly ^5 Ser Ala 

190 

155 ° ill ?r " ~ eu Le - — s Glu Arg Ser 

205 



A. a Ser 31-.- ?^>- 



;'al Val Lvg Pre ^> o 
J I 0 



-isp Val Ala Leu j ' - > — - . . - 

~'- u ^er Val Pro Pr^ 

" 2 0 2 3 5 



240 



WO 99/42076 



K T I S99 IU26S 



177 



Arg Vai Ala Ala Ser ?he a e ~ -p 

1 5 * G - e Ai P rr ? ^ Asp ser Pro Asp Se^ P ~ 

1-eu Asp Pro Ser Le 
2C 

30 



r Ser S « ^ I-u Asn Ala Val Ser Cvs Civ 

25 3Q " ^ 

^ Glu ser Ser Ma Ser Ser Ser JQa ^ ^ Gjy ^ ^ ^ ^ 

40 45 

Trp rhr Ser Met Pro Ser n v tk * 

50 ^ ° l J Thr Ar ? P ^o Gly Pro Arg Arg Ala Thr 



50 

7Q ■"- — ^ e p ro Ser Arg Arg 

9 Ala Glu ?he 31 v Thr 



Ser Arg Asp Asp Arg Arg Ser Ala Thr Ser Val I 

75 



30 

al Ala Pro Ar 

35 ^ i ' Ue ^ xV Thr ^ Leu Ala Ser Hi:; Arc: 



■i Ser Pro Ser Asn Ala Cvs Pr. 

105 



-"/s Pro Val Arq 1 1 V;i ' m « »- c„ * - 

100 „ _ ^ ^~ — r s er A^a Ser 

110 

31/ Arg p„ :le Sfir 3er ? ^ ^ ^ ^ 



115 — ~ ^ Ser ^9 Ser Cys Va 

0 125 



'^rq Ar~ 



Asp ;vs A.qn Glv 

130 * ~ d A * a S ° r G - Arg Arg Leu Asn Arg 

1 4 G 

^-a Arg Ser S^r ^ * « - ~ 

rt^a K^a Arc: ~- - _ 
14 5 , --^ ^rg ,hr Glv Thr ?h- 

~ 3U 155 * ltJ ^ 

Arg Arg Ser An 



* ier Ser Men Arc 

- o 5 „ _ ~ 



Jd: Thr Asn Ser Pro 



,a^ Ala Pr v ^v,^ ^ 

130 — Ara LP Glv :iv 

190 



: ■ arr.: 

q i — v r r 

- - * fc - ■ Jin::;:: 

- P . 3 7RAN7J EDNKS 5 : 



WO 9<)'42(r6 



l , C'T/i:sy9.'(»J26S 



[78 



35 

4 j 

Arg Asn Pro Arg Arg Se- Ar- a™ , 

50 * ^ Arg a Glu Asp Arg Arq Vai 

3J 60 



Phe Ala Ala Thr Val Ma ^ ^ ^ ^ ^ ^ 

75 

5 80 



«V «X »« «. J, «» U. lie ^ la „ „ r v,: c„ ^ „, G1 „ 



85 90 

95 



Ala Gly Arg Val Arc 



rg Arg Gly Gin Glu Leu 



100 HlS Hls ^ His Arg His 

05 110 

Sis ?r ° AS? ^ £j ^ Arg ^ HlS Arc, Ar. Val 

130 ^ ^ " 3 ^ Ar, V. _ 3l ., , eu Ar? ^ T .. r 

140 



Thr Ala His ^, <- * 

15C 



145 PhG Ma A - Thr Asp Arg Val Arg Hi, 



155 150 



Val Arg Gly Pro Ser Asn 

155 ^ _ ' — *yi Arg Gly 



Arg Arg Arg Arg Val Tvr Ar: 
170 



1^5 

Arg Hu s er 



180 " " "" /S i>r ° ^ a G:y Gly ^ G: V ^er Val 

IBb 190 



--rCPKATIC 



SEQUENCE CHARACTERISTIC" 

|a, 1£;;gth: -i ^^^^-5 

■IB) TYPE; amino acia 
'C' 3TPJUJZEDNESS ■ 



WO 99/42076 



PC I I N9M 03268 



[79 



65 



70 



Ala Glu Tyr Arg Asd Arg Arc: w o 

85 ^ d 9 ^ Va " ?ro V ^ Gin Arg Ala 

90 95 



«. <U. «■ Ma ^ ^ ^ ^ ^ ^ ^ ^ 

105 110 

H^s Leu Arg Pro Phe Leu Ser Thr Ha- G^u a™ » r 

GxU ^9 Asp Leu Thr Mec G^v 
12 0 . 



115 

31 y Glu Glu He Glv 
130 



Leu Pro Asp Ala Glu 

1.15 



125 

Val Thr u e ^ g Thr Giy 
140 

Gin Ala Leu Leu Gly Asd Ala Arc- T-n a1 . 

' J lr ? - eu Ala Ser 7^1 _ 
150 — 

^ 5lJ 160 



2er ;Ca Arg Gly r^- 



Leu Arg Ser Ser Arc: 



.i.rr; ^e\_ 



M.sp ;di A^a 
175 



190 



r Arg Glu Val Ala Arc: ; 



185 



g Arg Gly Pro Gly Arg Val 



L90 



:o Asp Gly lie Asp Val Hi 
155 



^ ?hG p « -u Ala Asd 

- CC 205 



Asp Asp Ala As 
210 



P Asp ser Ala Pro Has Glu ^ Al , - 

^ ^ — ^ ys Arg Leu 



■ * .-vcn 

12 5 



.^y ^ u Ser Ser Gin Ser 
235 =40 



■-■ Thr Arg Asn 31 y A 
260 



■1 <^a Leu Hu Arg Val V 



al Thr Leu 



3 :5 



— " -"K Ala Asr 



WO 99/42076 
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180 



;xl) DESCRIPTION: SEO ID ND:139 : 

C7CGTGCGGA TTCGGCACGA GCTGAGCAGC CXAAGGGGCC GTTCGGCGAA GTCATCGAGG 
CATTC3CCGA CGGGCTGGCC GGCAAGGGTA AGCAAATCAA CACCACGCTG AACAGCCTGT 
CGCAGGCGTT GAACGCCTTG AATGAGGGCC GCGGCGACTT CTTCGCGGTG GTACGCAGCC 
TGGCGCTATT CG~CAACGCG CTACATCAGG ACGACCAACA GPTCGTCGCG TTGAACAAGA 
ACCTTGCGGA GTTCACCGAC AGGTTGACCC ACTCCGATGC GGACCTGTCG AACGCCATCG 

AG CAATTC G A CAGC7TGCTC G C^GT— ~ rr^~^^ 

^CT^^ GC_GTTCTT GGCGAAGAAG CGCGAGGTGG 

TGACGCATGA CSTCAATAAT C?CGCGAC~G - A r-r^r 

J ^AC^CGAC ',j . 7GCTGCAG CCCGATCCGT 

^ tt"-^t^ » ^ , ^, rt „„„ 

^ ~ ^C\^, . ^'j^ joCJ/VvC ATTAACCAGC 

*-\C^t\^ w ^ — - GGCGTGGTG™"' ^■^^n— rw^^.^,^,^, _, WT ,__ 1 

- ^.O.l.AAT TTCGCCAAC2 

'-.j AT I 'JAGG CGGGTAGCCG GCTCGGTTAT CAAGAGTCGG 

— ^SCGGAGTAT CTGGCGCCAG T"^T~a~~~ - 

^- - on* ^_.-\AGTTC AACTACTTTC 

AGCACCGCCT GGACACTGCC TAAAGAGATC GCGTACTCCG 



• j jov. _ . .jAACGTGG 

AGCGGGGGTT GCAGCGGC: 



AA^.^.: ^v^AGAGCAC GGTGCGGGGG ATGTGGGTC 



- -^CGCAG r 



a^o_oiA -GACoAGTAC 







jGGGGGTTG ggtggtggga cccgggaz: 

~TTGCTGAC G2G3GAGTCC CTGGCGGA; 
: ' - - -j-AAAGGGGG GGCGGAC"': 

-..nATGGG TTTACAGGCG CGACAGGT: 



60 
120 
180 
240 

300 

3 fa" 0 
420 
480* 

S4C 
600 
660 
720 

7 30 
94.C 

j 0 0 



* * ■.--•/\v_ - ACATGGG Cr ' 



WO 99/42076 



PCT IS99 



TGCCGACCGG GCTGGCAATC ATGGGCATGG GCATGGC— ■ — - 

^ T ^"^' CTCCATGATG CCACTG7CCG 

GGGCGGCAGT GCAGACCCTG GCGCCACATC AGAT^r~r~ 

AGAT^C.C,, ^TTCGACG CTGATCAGCG 

TCAAGCAGCA GCTGGGCGG7 T CC^ TO CCGCAGTGAT GTCGGTGCTG CTCAC^c 
.GTTC^CA OGCGAMTC , rc=crACTG CAAAGAAAGT COCC^C CCAGAGAGTG 

GCGCCGGGCG GGGGGCGGCG GTTGACC— rr^r— ^ ^ 

— CAC^._ CCTCG^TACC GGGCCAAACC AACTTCGCGG 

_^C__ A^GGTGGT ATTCGTGATA GCGACCGCGC 
-^.c ,..^. jG CAT 7CCTGCCGAA ACAGCAGGCT AGTCATGGAA 
GAG CA C C G TT GCTATCCGCA TGAOGTCTCG — 

z information fcr sec. :c nc:: ?0 . 

i'- 1 SEQUENCE CHARACTERISTICS- 

(A; LENGTH: 19:3 base M j r s 
f3) TYPE: nucleic acid" 
; :'C) STRAND ED NESS : single 
TOPOLOGY: linear 

:>U 3SCUSNCS - E ^-"~- sec :d ^c:-c- 



■ -GGA GAAGTCGTT" GT r 
— -' - ^'w- ^ JrtL.. jmGGAT AA> 
-C JTAC COT CGGTGACGTT 

CTGAGGC CTTGCGCGCG A A " 
G7GCG7TT0 ACGCACATGG 



168C 
1740 

laoo 

I860 
1920 
198C 

2040 

2072 



TGG TCGAGA & - ^ 

TCAAGATCC~ "ACGAGGAC CTCCCCGGT- 



12 0 
13$ 



^-M.'oj^Vrt. j-wyC_ ^G^jAGG 

~ o v_, CA CG AG C AG AT 0 ^ s r 

' ^ - -j^- * w iG CCA 3 C 0 



— ^- „ j *v\jAACGACGA TOG "GG""\ m 

:CCTCT CGTAC'CGCTG soc 



WO 99/42076 .'C Kl N99 

182 

GACC3C3CGC AGGTCGCGAG TCAGCAGCTT TGCGCCGGCA GCTTTC3CCG TGAAGCCGAC 
CAGGGCATC3 TAGGTT3CGC CACCGGTGAC ATCGTGCTCG GCGAGGTGGT CGGTCAAGCC 



GCGATA7GAG CAGGCATCCA GTGCCAGGTA GTTGCTGGAG GTGATGTCCG CCAAGTAGGC 
GTGGACGGCA ACAGGGGCAA TACGATGCGG CGGTGGTAGC CGGGTCAAGA CCGAATAGGT 
TTCCACAGCC GCGTGCGCGA TCAGATGGAC GCCACGGTTG AGCGCGCGCA CGGCGGCCTC 
GTGCCCTTCG TGCCAGGTCG CGAATCCGGC AACCAGCACG CTGGTGTCTG GTGCGATCAC 
CGCCGT3TGC GATC3AGC3T TTCGCGAACG ATTTCG TCGG TCAACGGGGG CAGGGGACGT 
TCTGGCCGTG 3SACGAGAAC CGAGCCTTCC CGAACGAGTT CGACACCGGT CSGGGCCGGC 
^" — CTCCGTG .-.TGTCGAC™ GGTCGTTGCC GC3CAAGCCA 
AGGCGCTCGC GAATC C3CTT JGGAATCACC AGAC3TCG~3 CGACATCGAT GGTTGTTCGC 
^-^-.^o,^ . .,v__.-._. oCACGTTCCA TAGGCGT37C CTGCGCGGGA TGTCGGGACG 
^-^.^ UlA4 -GAACG ATTGTTTCGG AAATGGCTGA GGGAGCGTGG G3TGCGGGTG 
ATGGGTGTGG ATGCCGGGTT GAG^~GA— ^ 

U ^^^ C - TCATCGAGAG 7GGGCGTGGT 



:gga tgtcgacgtg itgggcacac 



:~a -re: 




ACG 

- :::r:?j-!AT:-r; jr : 



— p.; 



.ir.ear 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
139<J 
14 4 0 
ISO 0 
1550 
1620 
1680 



~ -j^- GAT GAG GGATGCGG.G GAG2AGTGGG TGGACACCCA 7GATCCGGAG -40 



130C 

: a 6 r 



3Ecge:;g™ :.^c?.;?t: 



3e: ::; \-r 



WO 99/42076 
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183 



cgc^ggtt ^co^c gg^ggctag gtcgaaaccc gcgccagggc CGCTGGAC3 , 

CCCATGGCA GCG^G 

AGTGCTTGGT GGTTCGTGTT TAGCCATTGA GTGTGGATGT GTTGAGACCC TGGCCTGGAA 
GGGGACAACG TGCTTTTGCC TCTTGGTCCG CCTTTCCCGC CCCACGCGCT GGTGGCGAA* 1 
-GGCGAG, CGGGAATGCT CGGCGGG7TG .CGG^CCGC TCAGCTGGGG AGTGGCTGTG 
-CACCCGATG A^GACCA CGGGCGCC, GCGCCGGACG ACGGCGCCGA TGTCGATGTC 

^AGGCGGCCG AflGGGGCGGA CGCAGAGGGC GCGGCCA— V— , 

^^u.uo A^GAGTGGGA TGAGTGGCAG 

SCGTGGAACG AGTGGGTSGC GGAGAAGG.~~ GAAP— ™ _ 

_GCT ..GAGGToC^' ACGGAGTAGC 

._ wjCATT;: rci-JGCGGG: GGCTAGGAGA GGGGGC3CAG AC7GTCGT7A 

TT7GACGAG7 GATCGGGGG7 GrCGG^G— r— — ~~ 

— C.^oGCCGG 7 7 A7 G A CAA C AG7CAA7G7G 

C ^ CAAGT TACAGGTATT A — — TCAACAAGGA GACAGGCAAC ATGGCAAGAC 
GGA7CCGCAC GGGA7GCGGG ACATGGCGGG CCGTTTTGAG GTGGACGGCC 

AGAGGG7GGA GGACGAGGC7 GGCCGGA7G'" GGGC"— — 

— ~~ oCAAAACATc: TCCCCNGCGG 

vj- Urtu * JVI -^GGCGGAG GCGAC777G7 TAGAC 



::;rcR>:AT:e:j 



SEQ ID :;c : 132 



^ITENCE C}tARAGTER:GT:rs ■ 
-OPCLCGY. linear 



300 
360 
420 

480 
540 

600 
660 

7 2 0 
"30 
840 
900 
960 
102 0 
*1055 



SEQUENCE DESC.-.ir^'GV 



sec :r. :; ; ■ 



..CI/IS99 0326S 

184 

(B) TYPE : nucleic acid 
(C; STRANDEDNESS : Single 
ID) TOPOLOGY: i;nej r 

SEQUENCE DESCRIPTION : SEQ ID NO: 193. 

AACOGOCCCS WGCCACCOC TCCTCT*** GCTCTCGTTG GTCGCATGAA GTG CTGGAAG 

GATGCATCTT GGCG.TTCC CGCC^GC ^CAGCCGC TAGTCCTAGT CCGACTCGCC 

CGCAAAGTTC CTCGAATAAC T-^T.r-^ ^ 

^TAAC T^GTACCCG GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC 

TGCGCGAACC ACTTGAGGTT CCGGGACTC- ^GAC-T— 

- ^AL^T^ uAGCGATTCG TTCGAGTGGC 

TGATCGGTTC GCCGCGCTGG CG— st— — 

-^^l^jAG^u -GGTGATGTC AACCCAGTGG 

<j\~t\ .■ALiAovj 4 .j, - '^^-GAGCTGT ^*^*^"— 

:^fcrmat:c:; pgr skq :d :; C;134; 

5E^JH:-E rHARA-TEP.IGncs- 
CAj LENGTH: 67 3 amine V^ds 
(BJ TYPE: ammo acid 
■:c: STRANCEDNESS : 

TOPOLOGY: linear 



oEQ-jhnce ce3cr:p7:-> : . 

*V Glu Vdl He Glu Ala Phc Ala 



3EQ ID mc ; 194 



^ Gl v Lvs ■ - ' ■ ■ ^ ■ 

- ' * - - ' J ^ - ~> - - s As n 



ii As n 3 e r 

30 



.Vji: A. i A _ . 

, ^ — -.^ rir ^ A 3D ? h e ?b , 



- <-i - '/a 1 - r :t Sor Leu A. ' i ~ ->v- 



Asr. Al a 



Asr. Ala He 31:: ~] 



60 
120 
180 
24C 
300 



W O <><> 420~C> 



l'( I IS')') 0326H 



185 



Thr Leu Ala Ala Asn lie Asn 31., Le u Tyr Hxs Pro Thr 0,3 Glv Glv 



155 



ISO 



Vai Val Ser Leu Ser Ala Ph 



165 



e Thr Asn Phe Ala Asn Pro Met Glu Phe 



170 



175 



He Cys Ser Ser He Gin Ala Gly Ser Arg Leu 



180 



155 



Arg Leu Gly Tyr Gin Glu Ser 



190 



Ala Glu Leu Cys Ala Gin 



1Q1 . * ^ LeU Ma Pro Va - L ^u Asp Ala lie Lys 

?nn J 



100 



205 



Phe Asn Tyr Phe Pro Phe Gly Leu Asn Val Ala Ser Thx Ala Ser Thr 

2 - D 220 

:: U 5 ?L " Uiu ^ a - r Ser Pro Arg Leu Gin Pro Pro Asn 

2j ° =35 : , 0 

^ ^ P ?hr Thr Va: ? ~ Gly He Trp Val Pro Asp Thr Pro 



245 



250 



255 



Ser Hi s Arg Asn Thr G 

^ 5 270 



26Q Z Pr ° Gly Trp Val Val Ala Pro Gly Net 



G^y Val Gin Val Glv =>-c ^h>- ~i 

.,-, 5 * w ^ ~eu Leu Thr Pro Glu 

290 23 5 

Ser Leu Ala Glu Leu Met G'v V - 5— - ^ ^ 

290 ^I; *- e ^ Pro Pro Ser Ser 

300 



-r Pro Pro 31 



~~a Tyr Asd Glu Tyr Pro 



33: 



Pro Pro Pre 



Pro Asd Val :i e 



G;y Pro Val Pro Pro Val Le-. 

3 50 



3-0 ' ->er JLy : je u Ala rhr Phe 

3 " B 38C 

535 ^ ^ ^ '" Wa Ar " 31 y Thl Mer Arg Hi 

395 



4 0C 



A. a Leu a; i a:..i p :> , v a 



WO W430-6 



re r rs'J i KU2(.s 



186 



43b 

Ser jeu Cl y W P he 01y Se f Phe , au Leu Leu Pro Sgr ^ ^ ^ 

453 460 



Gla Val Leu H is 01a s Pro Mec G1 ., Sgr Qly ^ ^ ^ ^ ^ 

4 7 S 

4/i 480 

am „ y Leu Gly ^ Met Leu Ala Met pro ne ^ ^ 

Asp Arg Arg Gly Pro Ala Lv= n e V -P T ., , 

5 0 0 7 " S va - 7al G1 '/ Ha Met Leu lie 



505 510 



Ala Ala Gly Leu Gly Thr Php Al dk q ^ i 

* Ala ?he Val Ala Ara Gin 

520 52 ~ " 

530 ~^ ^ ^ U Ald Ile M ^ G1 V Met Gly Met 

540 



v Cyj Sor Met Met Pro Leu 



550 555 



Ser G1 V Ala Ala Vdl Gin Thr Leu A 

560 

Pre His Gl-'^io Ala a— — ^ 

— - : a A., ber rrir Leu Ile Ser ^ 

sgc 31 ^ Met ^ er Leu Leu Thr Tyr 



585 59Q 



P: 



oer 



;q; T -* r Ala -/s -ys Val Ala Leu 

o 0 5 



: ei ^ ?ro Ar 5 21r. Thr ^ n Phe Ala 



-agd Pro je: Ter 



o.b 530 - 7tn ^ eu Leu Ser 



•3 J ^ 



54C 



6 70 



WO W42076 
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<Xi) SEQUENCE DESCRIPTION: SEQ 12 NC:19 5: 
Thr Pro Glu L 



?* Ser Phe Val Asp Asp Leu ^ ^ ^ ^ ^ 



10 



15 



Men val Glu He Ala Val G ln Thr u - r , 

20 ^ P LyS ^ Gly Val Lys U e 

25 30 

Pro Asp Glu Asp Leu Ala Gly Leu 
35 



40 



Arg Thr Val Gly Asp Val Val Ala 



45 



IVr Tie Gin 
50 



-ys Leu Glu Glu Glu Asn 



5S 



Pro Glu Ala Ala Gin Ala Leu 



A.r g Ala Lvs " " n i , . - _ , 

rq 7 G1 - Asn Pro Asd 



60 

Ala Ala Arg Ala Asn A r ~ 

75 



vs Val 



85 

Arg Leu Ala Cy: 



rt^a n rq Asp Al< 
90 



80 



^ ,,r^ ^ro . eu Ala 
95 



"^r Arg Arg Asp Pro Ara Gl> 



Arg Leu Pro Ala Ser Val Pro Tb>- 
"1 



u .Arg 
120 



INFORMATION FOR 3E~ 



136 . 



CHAR AC . HRI 37! C r 
LENGTH: 8 9 imi-c 
TYPE.- ammo acio 
^TRANDEENESS - 



Ala Cyo Gin Cvs H: 



■•vi r z . vr : 



Arq 



Arg Arg Pro Al » 



r. Aon A ^ a Ser 



WO W42(T(i 



PC I I S99 0326N 



188 



(iJ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 119 ammo acids 

(B) TYVE: amino acid 
(CI STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ "ID NO:197 : 

I 4 S " ^ " U ^ ^ S « ^ ^ Ala Ala ser Thx Ala Leu Ala 

10 15 

V " " a ^ ^ AU ^ «» ^ Asp A rg Ar g Tiir Ile ^ tap 

25 30 

His Leu A! ^ v^r- ?i; . „ 

3 5 Thr G1 " - D " Glv 

4 (J 

^' 50^ ^ G ~ ^ ^ Hls Leu Ala Val Ala Pro Glr 

bi 60 

Pro Gin Glu Val val Va^ r P1 . ^ 

55 *~ hls G Val Thr Glv Thr Gly 

75 " 'BO 

Glu Val Gin Glv Glu G1-- Ar- ^ ^ 

* 3 . ' 3 ^ G1 - Val Val Asd Pro 

90 



Glu Asn Glu Il< 



Leu Arg Gin Val Leu 



^ b 110 



1G o " ly Pr ° Ma Pro H ~ £ Asp 

- Jro Asp Ala Glv I'^ 



^'FORMATION FOR SEQ - :: C • ; ; - ; 



SEQUENCE CHARACTERISTICS: 
'A) LENGTH: IIS amino lcids 
3) TYRE: am::- ., C i- 

; itra:,i:ef:7e^^ 

L TC FO T ^rz"-' ■ ' ■ ^ „ - 



a Arc; Glv Hi 



W O 99:42076 



TCT I.N99 '0J268 



189 

Glu Val Pro Gl" 

65 * """ ~~ Vet * ^ xn T ^ Ser Phe Glu Trp Leu 



^ ^ U ^ U Val Thr Asp ser Phe Glu Tr 

75 

He G1X 3e, Pro ^ Trp ^ Qlu Ser ^ ^ ^ ^ ^ 

90 * 

« *. V,! „ y M> Leu G1 „ oiu ^ ^ m> s ^ ^ 



110 



Glu Asp Phe Ser 
115 

^ FORMATION FOR SEQ ID NO: 199: 

fi) SEQUENCE CHARACTERISTICS - 

(AJ LENGTH: ail base d.-,^ 
'3) TYPE, r.ucleic acid 
C) STRANDEDNE5S : single 
(D) TOPOLOGY : linear 

UZf SE ^— DESCRIPTION: SEQ ID NO:199: 

— _A, CAATCGCTTT GGTGACAGAT GTGGATCCCG GC3TCSCTGC TKCQATGCC ' 



— .^CA^C GGGGAGT^ACA TCGAACTGCT CAAAAGGCTG 
GCGGGCG3 CCATGGATCG : 

CACCAC x 8 



60 
12C 



— — .CGCCGAC GAGCGCACGT GCACGCACTG T^C 



. bH^^uu^ vj^^GCCGGCT 

^^ jC3TTAC - gg ™gcggg TCACGACGTG GTGGGCGTCG 
iCGCTGGT GCCGG~'"*GC' ; ~~^„ , 

J — ■'^^G. TGCCAGCGGG 

"^u^GCj GGACGCCAGC GCGC" , GG'"^" ~~~-t,~~„„^^ 

- -> ^ ^ ^ .j^-v i --TGGTGTGTC~ 



— ^.GGTGv.GT 3CGGGC 



- --^ .-a^. gt 



-C tatggcggc 



2-iO 
300- 

i b ■: 
4 a 

480 



: aaga: 




WOW'420*. fCTi:.S99/[)32 ( ,K 

190 

(C) STP.ANDEDNESS : single 
!D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: 5EQ ID NO:200: 

GTCCCGCGAT 3TGGCCGAGC ATGACTTTCG 3CAACACCGG CGTAGTAGTC GAAGATATCG 

GACTTIGTGG ^CCCGGTGGC GGGATAGAGC ACCTGTCGGC GTTGGTCAGC GTCACCCGTT 

CCTCGGACGC CGAACCCATG CTTTCAACGT AGCCTGTCGG TCACACAAGT CGCGAGCGTA 

ACGTCACGGT CAAATA~CGC rTOi—n „„ 

-AA.A^Co. G iGGAATTTC GCCGTGACGT TCCGCTCGCG GACAATCAAG 

GCATACTCAC TTACATGCGA G C C ATTTGG A CGGGTTCGAT CGC— GGG CTGGTGAACG 

TGCCGGTCAA G-GTG*"* r , ;r!r' ~~-r^ 

ow.^. JWW /^CGACAT GAGG7TCCAC CAGGTGCAC3 

G G A A G G A CAA "GGACG CAT C ~GG~^r- 

""^^ ' - G ^AGGTGGTGG 

ACTACGGCGA TCTTGCCCGG GCCTACGAGT CGGGCGACGG CCAAATGGTG GCGATCACC3 
AC3ACGACAT CG"CAG"t' — — 

GCAGCCGGGA GATC3AGGTG TTGGAGTTCC 
TGGCGGCCGC C3AGGTGGAG CG3ATGATGT TCGACGGCAG CTACTTTTTG GAGCC 

CGAAGTCGTC G A A AT G G TAT CTGC^G"'"' 



uA j 



-" JVT ■ - AAGACACT CGGGGAG^^" r ~ r^-^ * 

GATC3TGGA TCG'GGTAG" , > _ 

— -AGGAAAAA TAAGAGCCGC TATCCACAA 

jCTCGGCTAC UACAAAC33T AGAAC3ATC3 AGACA7TCC G GAGCGAAG' 

"~ ~ 1 '"'^ ~ - - -^Hrt G GA AAATAG G C 



^-v-jo...... ju.juu4-.ju GTCATCGCTT 



3 . TY? - - 



60 

120 
180 
240 
30C 

36C 
42C 



48C 
540 
' 600 
56 0 
72C 

?ac 

ACTCATGGGA ^ C • 



96 0 



WO 

!'< T rSW,|)326S 

191 

ctccgccgct GCCAccGrrc cckcsttck: cgatcaacat gccgct-gccg ccAcccrrcc 

CACCCACGCC ACCGGCTCCG CCCACCCCGC CGACACCAAG CGAGC^CCG CCGCAGCCAC 
arjCaK TACGCCACC3 CACCAGCGAC CO^CTTCC TGAAACGTCG 

CG.TGCCACC ACCGCCGCCG TTACCGCCAA CCCCACCGGC AACGCCGGCG CCGCCATCCC 
CGCCGGCCCC GG CGTTGCCG CCGrTGCCGC CGTTGCCGAA CAACAACCCG CCGGCGCCGC 
C3TTGCCGC, CGCGCCGCCG GTCCCGCCGG CGCCGCCGAC GCCAAGGCCG CTGCCGCCCT 
TGCCGCCATC ACCACCCTTC CCGCCGACCA CATCGGGTTC 7GCCTCGGGG TCTGGGCTGT 
GATGCCAGCG TTC 



2ACGGATACG ACGCGCGCCA CCGGC3CCAC GG7 




CCCCCCCGTG GCGCCGTCAC 
ATCACCGAAT AGCAACCCGC 



240 
300 
360 
420 
480 
540 
500 



CGGCGCCACG ATTGCCGCGA GC t— — 

^ ^C^CCGTC GGCGCCGGAG GCGGCACTGG 

- ~_^_AAA CCGCCGCTAC CACCGGTAGA GGTGGCAGT3 GCGATGTGTA 

GGAAAGCGCC GCCTCr^r- — ^ 

- ^^CTAG CACCGCCACT GCCGGCSGCT ACACCGTCGG 

ACCCGTTGCC ACGATGACCG CCAAAGGGGC TCGCAATGTG 



84: 




ACTCCGCCGT 
GGGGTCACCA GCGGCACCGC 
GCCCGAGCG7 GCCGTGGCGG TGGCACCGTC GCCGCCGGTG CGACCGGTCG 

--^GTCGCG GGGGGTTTGA TCACCGATGC 
^ACAT~ TGCCGGGCTG 7— GS7GC 7GGCCGCGGG GCGGGGCG7; JGATTGACCC 

_^ wOIJ .^CCACGGGC GCZGC7ATGG GCGAACAGGG 



960 
102 0 

ioac 

114 0 

12 6 : 



WO 99,'4207<, 



PC I l'S99-'0326S 
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CGGCGTC=CG - — ca CCCCATATCG 

CCGCACCGTG TCCCCAGCGG M CC TOcT 

cccsca.cc ccctcgttct G ^ CGCCCC 

GGGAGGACAG GCCCAGCTTC C„ CCCTCAAGTG GGAATGCACG CTCCGCGGCG 
AGATGAATAG GCGGACGCCG TGCTGAGTCC CTCACCGACC AGTAGAGCCA 

CCTCAAGCTC TGTCCCTOTC AACOCOCCCC CGGGCGTTTC CGT.CACCGC 

CGCGTACCCG A & CGATCGATAA CGCAGTTCCT TCCCCCCAGG 
CATGG ATT TTCGAACGCT 3GCTAGC5AC GAGTTACAGC 

- INFORMATION FOR SEQ ; E XC:2C2: 

(ii SEQUENCE CHARACTERISTICS- 

! Ai *f GTK: 37 5 amino acids 
s3) amino acid 

(CI STRANDEDNESS : 
•~ ; TOPOLOGY : linear 




SEQUENCE DESCRIPTION : SEC ID KC.Z02 

?rc Ala 

— - -^a A^a . 

: o 



' ^ a AIa ^ d f« r ^ Cys A.a Thr Val 

15 

- '^r jl u Thr Ala Ala 



15 

: " 3C 



35 " ^ " SP ^ T - ^ Ala Ala Ser 

45 

:; " A. i Ala • ~ . .... .... 

- "" . ' " " : ^ 1 L 7 - i:y :. : . 



<.r? Aa:: v, t ; A:a A _ A _ 

" 5 



-^r Ala 7a; Ala A. a 7a ' ^' ^ - - 

1 ■ A^a Thr r, : ■ 

op 

95 



.-^a /v _a ---- 
^ r; _ - • / ~ ^ . ^ v Thr 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

136" 



WO W.'42(P6 



PC T I S99 ,03268 
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His He cys Arg Ala Val ? rc Glv Ala ru, * 

145 Gx/ ^ a A^g Gly Ala Gly Ar^ Glv 

155 

160 



He Asp Pro Vai C , s Pro Gly Glu wa Giy Ala ^ ^ 

170 1?5 

Al- Ma M ec V.a Glu Gln rro Gly Val ^ ^ ^ ^ ^ ^ ^ 

185 190 
Pro Asp Aia c /s Gi , Ala ^ ^ ^ ^ ^ 

200 205 

Val Ala Pro Gin p-o n>-^ - , „ 

^ - J Pro jal Pro ^ 

210 n _ ° ^ ^ Gly Arg Ala 

220 

Glv ^h"^ - -I., 

2 = 5 ^ '" al AIa ^ ^ Ala Asp 3-, Pre 31v A, g 

Ala Ser Ala Aid Aia 3iv Leu ^- D 

24 5 ' ^ A:a Se - Arg Aia Val Ala 

253 255 
^-r Val Ala Lvs Gl:: g 1 - y _ - , ~, , 

-60 " ly ^ ^ a Leu Pro Glv Cvs 

26j 270 

^ ^ G ^ ^ ™ -P ^ Ar 3 Aia Pr3 3 ; R , vs Aro ^ 

" 2B5 

31y Gly Arc: He ::- 5 ~_ „. 

29G " * nr Pr ° Asr. Ser Glv p he 

" J ^ 3 0 0 

Ser Ala Glv 1^ ? — -u r _ , . , 

305 " ~~ '7" TL/ ^ 5 o:n -eu :r^:; ;.vs 



i J c 

31'/ Pro Glu J.r ^ - ... - 



WO 99/42076 



PCI I S99 0326S 



194 



GGCCAAAAC3 CCCCGGCHA' 7 * ^^rr^^i- 

C^.A. ..CGo^C. GAGGCCGCCT ACGACCAGAT GTGGGCCCAG 

GAC GTGG CGG CGATGTTTGG CTACCA^CC GGCGC^CGG CGGCCGTCTC GGCGTTGACA 

ccarrcGGcc acgcgctgcc gaccgtggcg ggcggcggtg cgctgctcag cgcg^cgcg 

GCTCAGGTGA CCACGCGGGT CTCCGCMC CTGGCCTTGG CCWUSXCCO CGAGGOCAAC 
OTCCGCAACG GTAAXGTCCG GAACTTCAAT CTCGCCCGG CCAACATCGG CAACGGCAAC 
ATCGGCAGCG GCAACATCGG CAGCTCCAAC ATCGGGTTTG GCAACGTGGG 

ACCGCAGCGC TGAACAACAT — — 

— ^ ^CACCGGCA G CAACAACAT CGGGTTTG 

AACACCGGCA GCAACAACAT CC^GTTCGGC ^TAC^,. - 



60 
120 
180 
240 
300 
360 

;gc 42 0 



AAC^ AG-3TATCGGG 48 0 



— .A 3C3G...GTT GGGGTTCGGC 3GCCTGAACT -GGGCAC™ CAT 



- vJUU 



CACCGG AAACGTCGGC A7CGGCAACT CGGGTACCGG GAACTGGGG 



^^^T CGGGCAACAG CTACAACAC" ^t^^.«, 

^ -o..,^^ A^TcCGGCGA CGCCAAC. 



^_.TCCA ACTCCGGAAT AGCCAACAC" — 

— UVC. ^_G^ JW _A AC3CCSGCAA CTACAACAC 

"5 G T A G G T A C A ACCCGGGCAA CAGCAA~AC" — — — , 

— — ACATGGGCCA GTACAACAC 

jGCTACGTGA ACAGGGGGAA CTACAACAC" 3GC— 

— ^^.uow^ ACTCCGGCAA TGTCAACAC' 

3CTTCTTGT GGCGC3GC3A CCAGCAAGG; 



- TT CAA CA A 



-J 'O 'v. J ^ » 1 rt 



-j - -GGGGGC GATCGGTAA 
•-^-..,juu -AACACCGT, 



^AAACGCGGG CGTG 



^GAACAT*: 



■•■■^w^-^ ^WvCSTCGG TGACTTCAAC 

: . _ cgtcigcag:- 



0 



600 

AC 3 66Q 



720 
780 
B4 0 
900 

.TTCTTC 96 0 ^ 

GGCTTC 102 0 

1080 



>ao 



WO 9y/-l207t) 

KTrS99'0326S 
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AACCAGCAGG GCTTCAATAT TGCTAGCGGC TGGAACTCGG GCACCGGCAA CAGCGCCCTG 
TTCAATTCGG G CA C CAATAA CGTTGGCATC 1TCAACGCGG GCACCGGAAA CGTCGGCATC 
GCAAACTCSG GCACCGGGAA CTGGGGTATC GGGAACGCGG GTACGGACAA TACCGGCA7C 
CTCAATGCTG GCAGCTACAA CACGGGCATC CrCAACGCCG GCGACTTCAA CACGGGCTTC 
TACAACACGG GCAGCTACAA CACCGGCGGC TTCAACGTCG GTAACACCAA CACCGGCAAC 
TTCAACGTGG GTGACACCAA TACCGGCAGC TATAACCCGG GTGACACCAA CACCGCCTTC 
TTCAATCCCG GCAACGTCAA TACCGGCGCT TTCGACACGG GCGACTTCAA CAATGGCTTC 
TTGGTGGCGG GCGATAACCA GGGCCAGATT GCGATCGATC TCTCGGTCAC CACTCCATTC 
ATCCCCATAA AC GAG CAGAT GGTCATTGAC GTACA CAAC G TAATGACCTT CGGCGGCAAC 



:CG GCCCGGTCAA TCTCAGCGCA 



TCCACGCTGA CCG TTCCGAC GATCACCCTC 



ACCATCGGCG GACCGAC3C 



;accgtcccc atcagcattg t; 



-CCGGCT TC.CAACTC GGGCACCGGT GGCACATCTG 



oEQUEIICZ CiiARACTERZGTTCG 
-A; LEIJGTK : >i: ^u:;o 
. H ) TYPE : amir, - a c i i 
STRAIN £2^33 



1580 
1740 
1800 
I860 

1920 
1980 
2040 
2100 
2160 
2220 
2280 



GCTCT GGAGAGCCGC 2 34 0 

"ATTACH TCCTC AAGAT GGA7CCGGCG CC3GGCATCG GAAATTCGAC CACCAACCCG 2400 



2460 

2520 
2580 
2640 

2 7 0 C 
2 76 0 
. 82 : 



WO W42076 



PC r/l ; S99;(M26S 
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Met Trp Ala Gin Asp Val Ala Ala Me, Phe Glv ^ Ala — ^ 

2 0 - - ° A 13 Axa 

** S 30 



Ser Ala Ala Val Ser Ala Leu Thr P^ 0 Phe -] y s: r 

*• u vly oln Ala Leu Pro T^- 

* u 45 

Va: Ala 01y 01y Cly ^ val Ma Ma ^ ^ 

5D 60 
Thr Arg Val Phe ^ Lgu G] y ^ ^ ^ ^ ^ ^ ^ ^ 

' ^ 80 
^ ^ ^ ^ ^ Asn Leu Gly Ser Ala Asn ^ 

Gly Asn Gly Asn He Glv ^ _ _ _ 

ioo " v:: ^ : ^ z * er ^ : ^ 

no 

^ ^ '' a: Pr ° - eu Thr ^ Ala Asn Asn 

120 l25 — - 

Gly Asa Thr Glv As- A-n - - 

13 0 * J A - e ^ y ?rie Gi y Thr Gl 



j:j 140 
Asn Asn He Gly ?he Glv As- _ • 

145 , " Arg Gly H e Glv 

" = u 15 5 . , ' 

*b0 



Ser Glv Leu :.eu Glv Phe Gly 31, • :,, u Ser 



Lei: Th- 

- - - *■ u.y ^ rf u ^ e U D « - , , ~ i . . - 

Gl 
170 

Gly Asn lie Gly Leu ?he -\sn ,er -w- . , , 

lao — J -' ^ n /a - Gi V ^ 3iy 

-95 190 

' ^ ■ ^ ~ .-^sn c^r j-y Asn oer ^- 

1:33 j OS 

Asn Thr G 1 v uhn "m,, - 

111 P ;1 * a rtSn Thr G1 '/ Phe Phe Asn 



?he A:,;: yet ; 



-eu As;: S^r Gly Asn Tvr Agh Thr 

270 



WO 99 42076 



PC I /IJS99 03268 
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305 



310 



315 



Asn Ser Gly Ala Gly Ser Ala Ser Gly p 



32 = 



Phe Leu Asn S 
330 



320 



er Gly Ala Asn 



335 



Asn Ser G l y PHe Phe ^ ser ser s „ Gly Ala ^ s 



345 



350 



er Gly 



Leu Ala Asn Ala 
355 



Gly Val Leu Val Ser Gly Val H e Asn Ser Gly 



360 



Asn 



365 



hr yal Ser Gly Leu P he Asn Met Ser Leu Val Ala ^ 



370 



375 



Ala lie Thr Thr ?ro 



380 



£ zeu Iie ser 3iy fl 1 : phe - 01 7 Ser ^ Mec ser slv 



Phe 
400 



?ne v^ly Gly ? ro ?r3 Val p , ^ 

, -l/ -eu Ala Asr. Ar^ 3 lv Va 1 

410 ^ .,; 

Val Asn Hp Le^ 1 n 1 " ^« 

- ^ c... _ n ^ Asn Giy ^ ^ As:i Leu Gly 



42; 



430 



3er u ^ ^ Va - 31 v Ago ?he A.r -p r r 

435 " ■ ^ U Uiy Se ~ Sly Asn Leu Glv 

4sC 445 

Gin Asn i:^ i^u G.v S- 
450 

Jly Asn He G 



3^y Asn 



4fo 



455 

'"a.1 ?he Asn V 



Val Gly Ser Phe Asn He Gl } 
46 0 



d ~ ^ S * r '^V Ser Leu Gly Asn 
" 5 480 



'"vr Asn 



4 9C 



495 



Asn Va. Glv Asp 7yr * 5n /ai 

500 



V A.sr: Ala Gly Asp Phe Asn 



As:: 



rrn As 



56C 



WO 99/42076 
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l , CT/l.'S99/03268 



5iy He Leu Asa Ala Gly Asp Phe Asn Thr Glv Phe Tvr Asn Th^ Glv 
bl ° ol5 



620 



Ser Tyr Asa Thr Gly Gly Phe Asn Val Glv Asn Thr Asn Thr Gly Asn 

630 635 640 

Phe Asn val Gly Asp Thr A S n Thr Gly Ser Tyr Asn Pro Gly A SD Th- 
"5 650 ■ 



655 



Asn Thr Gly Phe Phe Asn Pro Gly Asa Val Asa Thr Gly Ala Phe Asp 

6 6 0 



665 



670 



Ly Asp Phe Asn Asn Gly Phe Leu 



Val Ala Gly Asp Asn Gin Gl\ 



675 630 6a5 

-in -le A. a II- Asp Leu Ser Val Thr Thr Pro Phe :i» p rc Aq „ 

700 "' ^ " 

Glu Gin Me, Val He Asp Val H,s Asn Val Met Thr Phe Gly Glv Asn 
?1J 7i5 720 

Met He Thr Val Thr Glu Ala Ser Thr Val Phe Pro Gin Thr Phe Tyr 
/25 7 ^0 735 

Leu Ser Glv Leu Phe ^h^ d>-^ , 7 ^i - 

" ne J */ Pro Val Asn Leu Ser Ala Ser Thr 

•'4 0 - T * r 

J 45 7S0 

*• 

Leu Thr Val Pro Th- r-,. T . r 

**- I-e G-y Gly Pro Thr Val Thr 

' bD 7 ^ 765 

- ' - ' a - »eu J.u Ser Arg Thr He Thr Phe 

"80 

_ys lie Asp Pro Ala Pre V v ^ ^ - 

78S • ' ■ - — Thr ^ 

-° 3C0 

»r Glv Phe Gl- 




WO 99/42076 



rCT/i;S99/03268 
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Ala oly Leu A^a Asn Leu GI" n - t 

u-l, bin Leu Asn He Glv Se- Al^ - 
9 0c Ala Ser Cys 

~, «. £ .x. , su Mp ^ vtl ,„ Thr ;l . Us ^ 

925 

Cys Gly ser Ala ser Asp Glu Ser ^ Pro Gly Ser v a l Ser Glu 

(2) INFORMATION FOR SEQ TD NO:205: 

(l) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 53 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

XZ ' ^ EQUENCZ ASCRIPTION: SEQ ID NO: 2 05: 
3GATCCATAT GGG COAT CAT CATCATCATC ACGTCATCCA CAT CATCGGG ACC 
C2' INFORMATION FOR SEQ ID NO:206: 

■ ( i ) SEQUENCE CHARACTERISTICS • 
:a; LENGTH: 42 base'oa^ 
■:3J TYPE: nucleic acid 
~: 5 TRANCED NESS . single 
TOPOLOGY: linear 

3EC:JENC " ASCRIPTION: 3 EC ID NC -06 : 



EQUENCE CHARACTERISTICS. 
■A, LENGTH: oase 
3i 7T?E: nucleic acid 

~ TTANTFDNES^ ; - . 

■ -inear 




WO <)4/.|207(> 



rC'T/l.'S^ '0326S 



zoo 



rCTCAATTC AGCCCTGGAA ATCGTCCCGA T 

:) INFORMATION FOR SEQ ID NO:20 9- 

(i) SEQUENCE CHARACTERISTICS - 
(AJ LENGTH: 33 base oairs 
fB) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(E>) TOPOLOGY: linear 

(xii SEQUENCE DESCRIPTION: SEQ ID NO:209: 
lTCCAGCG CTGAGATGAA GACCGATGCC gct 
INFORMATION FOE SEQ ID NO :21Q : 

SEQUENCE CHARACTERISTICS ■ 
■AJ ZZmzy.; 33 base 
;3J TYPE: nucleic acid 
(Ci STRAND EDNESS : single 
(D) TOPOLOGY: linear 

{X: ' SE QUHNCE DESCRIPTION: SEQ ID NO: 210: 
IATCTGC AGAATTCAGG TTTAAAGC~~ ~„ 
INFORMATION FCR SEQ ID NC : 2 1 I 

SEQUENCE CXSuh\crZ2Z37ZC?. 
''AJ LENGTH: 30 base oa" — s 
3) TYPE: nucleic acid 
: C ; S TRANDedi JE S3: 3 1 na I e 
~ ---^ i ■ - mear 

xi SEQUENCE DESCRIPTION SE^ — 

ATCCGA GCCACGTGC- CACAACGGCC 



31 



3 3 



38 



."-^vrr. .m. ^. J ~ ~' T 

STRANDEDNESS : Gincrl- 



APOLOGY 



--near 



WO9«W207(, I.CT.r.SW(»320S 
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(C) STRANDEDNESS . svng i e 
(3) TOPOLOGY: linear 

SE 0DEN CS DESCRIPTION SEQ ID NO;213: 



AGGTAGTGGG CrST o/, ^«'^ ^ « ~. 
TAATAGT GGAGTCTTG 
rGATTTA TAAGGGA TT 



-AATAGT GGAC— GT TCCAAA™ . 

--- .^^wil—j AACCGTA1 



' TGGTTACGCG 


60 


TCTTCCCTTC 


120 


TCCCTTTAGG 


130 


GTGATGGTTG 


240 


AGTCCACGTT 


J 00 


CGGTCTATTC 


360 



ACAAAAATTT AACGCSAATT ~AA~\A — t - — 

ACAATTTCAG GTGGCAC— 
TC3GGGAAAT GTGCG CGGAA CCCCTATTTG mf Tn^ Ta ,, T 

lT ' Tv - TAAATACATT CAAATATGTA 
-CGGCTCATG AATTAATT CT TAGAAAAAC r-vr„ 

CAAATGAAAC TGCAATTTAT 

* — ^^"ATCAGG ATTA^CAA^h ~— ' ,„ 

** — gaaaaagcgg — „„„ 

gaaggagaaa 

^ctcacggag gcagttcca™ ;r-^— 

^ * rt! J^n . GGG AA ^^^^^^^ ^ 

- ^rx^ . CGGTCTG GG ATT G G G A CT G 



AGTGACGAGT GAATGGGGTG - C -V- 



- ^ W iAA A T ~ A C T 'J 'J , ,-, ^ „ _ 



-vrvLAAl 



^GTGTGACGA 
'*^A"TGTGGCG 



480 
540 

500 
5 6 0 
720 



ATC AATACAA G GT ATTAA—— — — _ . . 

— TAAGGTTA TCAAGTGAGA ?80 



34 0 



^ ^ ^ - — Ji GATTGG G ^ " i " ^ " - 

^ wrtuu,n -AGGAAATAC 3GGA'-^ — - 

^-^^^ tt.waggac 960 

-wViAC AGGAATGGAA TGGAACGGGG - : - f v--~. 



WO 99/42076 PC I7i;s99/0326S 

202 

gatccttt 



r-CC 3 C=T AAT~=_. 0 c TTGCAAACAA AAAAACCACC G~ACCAG~ 

™™ .c ct= a aG „ ccuktmc 

AGAGCGCAGA TACCAAATAC 

AACTCTGTAG CACCGCCTAC ATACCTCCf- r-r™, 

ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 

AGTGGCGATA AGTCGTGTCT TACCGGG7TG GACCAArar r» 

^ GACT^UVGAC GATAGTTACC GGATAAGGCG 

CAGCGGTCGG GC'GAAr" 1 ".-" ~~ „ 

-^AC_ _ TXCCTCC ACACAGCC:CA Gcrr5GAGC3 

A C G GAACTGA GATACC^ACA ^, „ 

-AGAAAGCG CCACGCTCC CGAAGGGAGA , ,„ 

AAGGCGGACA GGTATCCGG"" ' " 

> GTCGGAACAG GAGAGCGCAC GAGGGAOCTT 



1560 
1620 
1680 
1740 
1800 
1860 



--GA^TTGAG 



_ " TC -"" VGGGGGG ~ GGAG CCTAT GGAAAAACGC CAGCAACGCG 

— - C_C~C CTTTTOCTGS CCTTTTCCTC ACATGTTCTT TCCTGCGTTA 

--CCCTGA7T CTGTGGATAA r-3TAT»AC - * 

- u — -GAGT GAGCTGATAC CGCTCGCCGC 

AGCGGAAC3A GGGAGGG"-^ — ^ 

— A^AGGAAG CGGAAGAGCG CCTGATGCGG 



—-A.. TGACACGGCA TATA7GGTGC ACTCTCAGTA 

1 ' rti ' J "" vjCA TAGTTAAGGG AG7A~ACA~ — 

^ ta uv.. rt *LoC TACGTGACTG 

A 00 ^ ~ 7l_ — oAC AGG^G"" 1 * " ^ 

JrtL — G rt C GGGCTTGTGT 

GG ^GGAGGTGGA 7GTGTGAGAG 

' *v.__, _ -.-.TGAGGGA AACGCGrGAG :G^G~~ ~ 

* — ' — vm^.GAT GAGCGTGGTG 

jTGAAGGG/aT TGACAGATG'" 



* 36 u 
2Q4C 
2100 
21GQ 
2220 
2280 
234C 
2 4 00 
2 4 b l 1 
2 q 2* 
259 0 



2ACTGGAACG 



-w.'%_^GATGT AGGTGT7GG" — - 



WO W/AZO'U 



202 



l'('T,'l'S«WI)32(>N 



CATGCCGGCG ATAATGGCC7 3C "CSC* -sv, 

--~c^ ^crrra gtggcgggac cagttucgaa 

"^ TOCa ™A —G CAAGC 

^■ : " GCGA AACC0CTCK sacccagagc gctgccggca cc^tc-ac 

OWTTOC*™ ATAAAGAAGA CAGTCATAAG XOCGGCGACG ATAGTCATGC CCCGCGCGCA 

CCGGAAGGAG CTGACTGGGT TGAAGG,— 

^AACkKrr_ uAAGGGCATC GG TCGAGATC CCGGTGCCTA 

ATGAGTGAGC TAACTTACAT 7AA-~T^ — 

^ iAATTGC — -CGCTCACTG CCCGC— CC AGTCGGGAAA 

Cw * jaCSTGC GAGCTGCAT7 AA7GAATCGG —An- 

— -w*ACv^_^ jGGAGAGGGG gttttjcgtat 

-*-*^Cw\ GTGAGACGGG ^in^.-r. 

--gcgtggc" ~tgagagac7 tscagcaagc , 

\cgc7 :-g. _ agcaggcgaa 
AATc.Grrr oatggtggtt aacggcggga tataaca — 

—AA^.un — -GTG^TCG GTATCGTGGT 

ATCCCACTA7 CGAGATA~— ~ r~~- - — - ~ 

— G-__ GGAGCCCGGA CT CG G T AATG GCGCGCATTG 

-GCCCAGC3C CAT-3ATCG TTGGCAACGA 3CAT~CA~ — ^ 

^^xv.^,^ 0x ^urGAACGATG CGGTCAT^CA 

^---^.^ ArtACGGGAC\ TGGCACTCCA — ■ ~*v~ 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 



- --j^ .vj^v^. - GCGA G7GAGATAT" - ^„ 

-^-oCw*C^_ AGGGAGACGC AG AC 

^■^A 1 - AG^tr^i — » A-rv ^ GGGG7Z GG""AACAG . rii[ ^ 

— t.T jvjGAGAAAAT AATAC" 

jGT GAGAGAC AT GAAGAAA 7 



37gc 
3340 
3900 
^A 3 960 

4020 

GCGACGAGA7 ^ 0 8 0 




-'■vj anA7 j AA ^7 ;;^ ™- 
j ^ -rGGTGGCC "7j"" T, -y 



-j^-^tvjm^^ - -TGA7AA GAGACACC"_ 



t 2 8 C 



4 56C 



v - - - - ^ nA. .3AC 



204 

r~— ax * rcara -~ 

- GAGC " ^t-t*,-.^ 

TO _C GATGCGTCC3 C C„ 
™ TJ M=caiflTM Qu(TOcct 

TTTTGTTTAA CTTTAAGAAG GAGATATACA 

<^TACA fAxGGGCCAT CATCATCATC ATCACGTGAT 
CGACATCATC GGGACCAGC" mr*~r.n~ 

^AACAGGCG GCGGCGGAGG CGGTCCAGCG 
GGCGCGGGAT AGCGTCGATG ACATrr— - r. 

ACA.CC^. CGCTCGGGTC ATTGAGCAGG ACATGGCCGT 
GGACAGCG CC OGCAAGATCA CCTACCGCAT CAAGC— ^ 

GTGTCGTTCA AGATGAGGCC 

-^caac:g aggggctgga aaccaccgag „ 

^^^CGCC ggtgacgttg gcggagaccg gtagcacgct 540 

j ^GTTCAACC TGTGGGGTC" ~r-o^ 

" " C GAGAGGTATC ^AACGTCAC 54S 

C^G^GCACCG GT^^— -r— _ 

-^C3 CAGGCGGCCG CCGGGACGGT 552( 

LTCrGTr " G ^ GGTGAT ^GGGGC ACAAGGGGC^ 35go 

— ^_A7C GCGCTAGCGA rc— — ... 

--AGGTCAAG ~A~:i-r— - ~ 

— ^A.C_ CCGGAGTGAG 5640 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
q 2 90 





-3AGCAGGTC AAGCTGAACG ; 
-TGGGACGAt: 

-ggcggcctg aacgtggggg gcaccgcggt 



: GGGGGATG TACGAGGGCA ~CA7CAAAAC 



GG 'jTGAGAACGG gaagggc— ^ 



570C 
bd2 2 



-sgtgccg ggtgcgc ^ ' ^ *^" C 5dHC 



G3G~ -GGGTCGG- A*"' _ ' ^ — CCGCC3A 59 4C 

-—AACGGGG 

'"'jGlg jCT'j — - „ 
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G -C_ C -<^GGTGGG G— c^Vr^ ~~ 

* ^ — GCCAATA AGCAGAAGCA 

oGAACTCGAC GAGATCTCGA C— 

GGAGGAGCAG CAGCAGGCSC TGTCC-GCA ^ 

" OTGCCCACAA CGGCCGcc^ 

GCCGCCGTCG ACCGCTGCAG CGCCACCCG^ ^ 

^rn^^ JrTGCCC C -CCACCACC 

^GC^^C^GCC AACAGGGGGA ATrr— 

^ ATGC --AGCC GGGCGATCCC AA-PArr,, _ 

AA-^CAGCAC CTCCGCCGGC 

— A^GGAAG GCAGGGGGGG CAG— - - 

. ^CCCCAAAC GCACCCCAAC CTGTC— A- 
CGACAAGGGG TC ^ Ai ^4 0 

^ — TCAGCTTCGC GCTGCCTGC- G~r- ^ 

^"-^ G ^C.GwGTGG AGTCTGACG" 

-ggggacttg gagtacggt- 

— GAGCAAAACG AG"-"-'"---- 

-ggacagcgg — _ ' 

— LCAATGACAC -GGTA7GGTG GTCGGGGGGG ^ G ^'* 



"AC3CG AGr^~~~-* * ^ 

CAAGGGC 



GGGTGAGTTC TATATG'"'^ 

Uoun.CAAC CAGGAAACCG TCTCG~T 

- ^ - _ ^xAGGG " 



^j^GCGT GAGGGCTGGT 



-^rt^j-^^ GGTGG "^'GAA'"'^'^ " ~ 

- w ° ,,j -- - - *g gtggcggggt ~— ~ 

jGCAGGGGGT — ~ , 



■■■x-H:r;cH: gharai-t^o — 

^ * *'?E : ammo ;icid 
s trand edited o 



6540 
6600 
6660 
5720 
6780 



6900 



~02„ 
7080 
714 0 



— o i G7GTGGAA GGGGG7CGTA — ^r~~^~„ 

-~CmAG.C AAGTTCAGGG A^-ar-v 
- JrtrtCGu ^ GAGATGTGGA CGGGCGTA " - — 

~— ^ol_ ^GGGCGAACG CACGGGACGG 

jTGGTATG ggtcgggacc gggaacaagg cgg— - 



'260 

'32 0 
380 



-GAGC ACCAGGAC^' - — - 
-GGAAAGT" ' - — , _ — — ^ATCGGGCTG GT AA G AAA G G 
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106 



25 



30 



Asp ber val Asp Asd He Ara ^ 

K fc ^ g A^a Arg Val Hp r^, -i 

35 40 y 1 Iie GiU ^ Asp Met 

4 5 

«■ »« - P „ r u . Gly ^ Ile ihr ^ ^ ^ ^ ^ ^ ^ 
« Ph ' net "*» ,? 01 " p " s " •» - 

*' Jf " 7 «v «. =a y 1 val „. TSr ^ ^ 

Ala Ser Ser P-o t/, ^ ^, ^ „ 

y* - -hr ,eu Ala Gl, Thr Giy Ser ^ ^ ^ ^ 

"" 5 110 



Pro Leu ?he Asn 
115 



-eu Trp -jiv r>_ 



- n h^ His 31 u Ar: 



Val Thr He Thr Ala 



130 



^-i~n Glv Thr 
13 5 



Ser Gly Ala Gly He Ala Gla 
140 

Ala Ala Ala G T - » 

14d J — Aia ier ASD Aj 



150 



jIu Gly Asp Me- ,\1 



lo 5 



15 : 

Leu Met Asn 
170 



p dia Tyr Leu Se* 

16 0 



:le A1 <3 leu AH 



Ser Ale 



iac 



v al Asn Tvr 



7*? Leu Pro Glv Val Ser Glu Hi, 

^ 190 



jcu Asn Gl- 
195 



•^5C .Ayr: ?r 



-VS 7a. Leu Ha A. 

- Ala Ala Leu Asn Pro Gl- 



a Xgz Tyr 31- civ Thr T 1 - 

20^ 



j ... n 
215 



220 



a^ As r: 



2he 31 



Thr val Asp ?he 



Ala Val 



:?o 



^* Asn G ; y Jl" Mec Va' 

2 3 : ~ 
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Cly Asn Ph . - eu Leu pro 

325 — Gin Ala Ala Ala Ala 

Giy Phe Ala Ser Lys T h- p~, ^ 

34Q *, Ala £ Gla ^ a ne Mfit ne ^ 

G ^7 Pro Ala Pro 



Pro Asp Glv T y r P^n n 
355 * 3^0 " Glu Tyr Ala II 

^ ^ G ^ ^ Asp Ala Al 



365 



370 

?he Leu 
385 



Ala ^ei 



a Thr Ala Gin Thr Leu Gin Ala 

380 



' r? Al a He T ^ >- 
390 



A^p Gly As 



11 ^ a ^ r ^ 

-300 



^ ?r ° Le ' J p ^ ?ro Ala v d ; 



405 
Ala Thr 



41! 



L ^u 3er Aed 



° er Sc>r Ala GIu Met- r, ■_.-<=: 
425 



Thr Asp Ala Ala 
430 

-e'- A^a Gin Gla a* ^ 'M,, - 

43 ^ ^ ^? -le Ser Glv - 

440 y Asp ^eu 

445 

- yS ~ hr --e A* n - a : „ 

^ " - J - ^ ^ Gly Ser Leu Gl, 3 .. 

460 



45i 



m-l a Gl - 



Ala Glr. 



Ala val Val Ara 
480 



Ji - /\j.a .i^a ^ S r, - . ,„ ^, 

" *"* ° -vs G-- - 3 , 

48 5 ' h^d GIu 

4 9C 



49^ 



500 



■^a 'e 1 



/al Tvr j*- . 

505 ' " * 

510 



j 1" Mor . . DV 



i Ala A. a 



As ? ?r: a,,- 



5 6 5 



5^ 



5 r> C 
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tep Ala Ala His ?h» Asp Tvr G] v - ^ 

sS ' ^ ^ U LeU S « ^ Thr Thr 

620 

Gly Asp Pro Pro Phe P-n ^ 

^25 G ^ Gln ?ro Pro Pro val Ala \<-t- a 

630 a Asn Asp >- 

- «« u. „, ^ L . u „ „„ ^ ^ ^ Mj ^ ^ J° 
"* ** °* P E «" ~ "* - - «r - «, « I 

670 " ^ 

Phe Tyr Met Pro ^ ?ro 

fi7 5 • / £ ^3 Asn Gin G lu Thr Val S e^ 

685 

ieu AJ * asr a:v val -iv - 

n9b ^ "-*al Lys 

700 

Ser Asa « - -, 

" p ° er -vs ?rn :.c-« 



.c ;: e Trp Thr -iy vai :i, 

: ^ As- Ala Pro Asi 



720 

cro Ala Al - 

— Ala Gly Pro Pro Gin Arg T - 

■73C f 

' /a - Val Tro Leu ^ 

740 ~ ^ a ^ Asn Val Asp L ys Gly Ala 

750 



3 leu Val Ala Pro ? ro ?rc 



— i ~ys Ala Leu Al ^ ^ 

75 5 J - -u. - r3 _ eu Vrf , 

760 

765 

A^a ?r- Ala ? ro Alrf :> r ~ v , _ 

J ■ ?ro ;^a Pre 

^ao 

Val Ala „. 



30 0 



I'C'r'I'S'W (13268 



209 



TCTTCCACCC CATSGTT^ £Sr~~ GGGTCiAACCA CCAGGCCGTA CC-attr-r- 
GTGCCGCCAT c ^-- S C TO GCTGCGTAAT £££££ 

TOR SEQ I DNO:2l6; 

( i > SEQUENCE CHARACTERISTICS - 
A LENGTH : 470 base pa lrs ' 
B- 1 TYPE., nucleic acia 
(CJ STRANDEDNESS: single 
(0) TOPOLOGY: Imear 

«°^C-JL S TYPE: Genome D NA 
(X1) SECDENCS -SCRXPTZCV. SEO I D NO:2i6: 

Sis ii5S SSS £S£ ««« 

AGCTGGACAT GCTGC^ACC ^P" 50 " 3 ™ GCGCCG CGGCC^-- 

::^ G f G :^~~ :rl T ^ t^™^ 

S^ CGC ; SSS gtgatcacca 

— ~ 2225 

IOTOR ^-ON TOR SEQ ID NO:- 7: 
; A ^ngt- 279 base oa:rs 

* -^fOLCGV . l;near 

z TV? e ; .lencr^;: DMA 

' '*« ■ ^3 4 j NO : Jl- ; 



-CGAT~GA7 ATGGATGAGA 
;^ C '~~'^GGA -AGCTr-AAG-J V— 3c---— ' ™" TTG3A " ' 

-;agc : ^?^:: C : jag "-"a:', .\aot"tt-^ - — 



- : """.^:;c:; pop se^ 



;jc : 2 : a : 



360 
•12 0 
45-4 



iao 

240 
300 
360 
420 
470 
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^ C ^I C ^ ACTCGACGAG CGZCZCGTGG AGGTGTCGAC CGACAAGCTC GACACCGAAA 
TCCCTCGCCG GCCGCGGGTG TGCTGACCAA GATCATCGC7 CAAGAAGATG ACACGGTCGA 



GGTCGGCGGC GAGCTCTCTG TCATTGGCGA CGCCCATGAT GCCGGCGAGG CCGCGCTCCC 
GGCACCCCAG AAAGTCTCTG CCGGCCCAAC CCGAATCCA 

{2} INFORMATION FOR 5EQ ID NO: 219: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 
(DJ TOPOLOGY: linear 

(ti) MOLECULE TYPE : Genomic DNA 

ixi- SEQUENCE DESCRIPTION: SEQ ID NO: 219: 



-; SEQUENCE CHARACTERISTICS: 
■A; LENGTH: 515 bas~ pairs 
(3) TYPE: nucleic acid 

7' STRANDED NESS . Ginql- 
:D) TOPOLOGY: linear 

.i! MOLECULE TYPE. SencT.ic DNA 

:i. SEQUENCE CEGCRI?TIC:: 5~r. - 



. — G^t/-vT _ G 



^ :y-^3c: GACAT7GG7G 77GCGCCCG7 77C7AAGCC7 GCAC77AAG7 

;G I G7GGC7GAG 77777A777T ACGTGACGC7 S7TGGTGCGA AAGCTGGCGT 

j-.j^^.jl^ j;u k ...Vw'-s AGAAGCGGG7 7AAAG7AC7G 7777~~~7A; 

_ ^ --AGAAG77GS 7TT7GACCT7 7T7G7GGGT~ 

- v. ■ . . -Avjvj.T. vS /j 7 GG 7 7 GAAA T 7 'IT" 7 7 AG T*T 



.■1 .. - ._ j 



60 
120 
180 
219 









77AAGCCCGC 


ACCCAAGCCC 


'^j i ^ CCGAGC 


D J 


777GAAGGC7 






CATCGCCGCC 


GGCGGCCCAG 


CCAGCCGGTG 


120 


CGCACCGTAC 


GTGAC 


GCCGC 


TGGTGCGAAA 


GCTGGCGTCG 


GAAAACAACA 


180 


CGGGGTGACC 


GG CAC 


2GGAG 


TGGGTGGTCG 


CATCCGCAAA 


CAGGATGTGC 


240 


TGAACAAAAG 


AAG CG 


SGCGA 


AAGCACCGGC 


GCCGGCCGCC 


CAGGCCGCCG 


300 


CCCGAAAGCG 




i. GAAG 


ATCCGATGCC 


GC 




342 


) INFORMATI 


CN FO 


R SEQ 


ID NO : 2 2 0 : 









INFCRMAT 



wo yy.'42076 



per rswo326s 



211 



2} :nf3rkation for 



oc^ :r ::c:32i 



■:r-QU^."J_ T KARA 3TERI3TI 33 
oase 1 1 :-; 
r.uc^ e i r. acid 
--ANE: ERNES 3 . :;r.g;e 



60 
120 
180 
240 
300 
360 
420 
480 



ixi) SEQUENCE DESCRIPTION : SEg ZD NO:221. 

GGTGCAGATT ^AACAGGCCA TTGAGGAAGC ACAGCGCACC CACCAAGCGC 
G ™ TCGG ™ ACCAGCGTCA ATTGGAGATG 
^SSSc Sc™^ J«CTTCAGG TCAATGTGCG CCAAGCCCTG ACGCTGGCCG 
AC^GGCCAC CGC^CGGA GACGCTGCCA AGGCCACCGA ATACAACAAC GCCGC-rznr 

£££££ ™S gg 5 ™ ccgagc agagcgtcga agac "^ -^'-1 

ACUU5GCGCT TAGCGC^GUV.GCTCAGGCCA AGAAGGCCGT CGAACGAAAT SCGATCf-rv 

ssssj s G T c CG s 3AG sr*** tgctcagcca gctcga ^ g 

cSSS ™r TC3A TGAGTGAGCT CGCCGCGCCA GGCAACACGC 

SSg? GACAAGATCG ^cctcgcta CGCCAACOCG ATCGGTTCGG 540 

557 

(2) INFORMATION FOR SEQ ID NO : 2 23- 

- - ) o ECIITNCR CKAicACRE ?. i S T I CS : 
■.A) RIRNGTH : 32 3 base oairs 
i3) TYPE: -c;e:c acid 
fC) 3TRAN0EDNESS : single 
(O; TOPOLOGY: linear 

;::! MOLECULE TYPE: Genomic ONA 

,x:! SEQICHirCE DESCRIPTION: SEC ID NO ■ 222 ■ 

-AGGATAGGT TTCGACAT3C AC""~'V'""* r ' — — - ^^.^^^^ 

AGAGGTGGAC — ~-7~™~I~ I"' — -^CCGT GTGATAGGCC 6 0 

^ „_ r _ -3 JAGGACTC AACAGAAATG 033™^^^ --> 0 

;::^;r:r G ?r;::::? T gagagc — ccgaggggac cgttacccgc tggctcaaIc He 

.^,^G^^ - rik -^ * CGAA CTCGACGAGR 333TG3TC 



223 



COPCLCGY: linear 
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(-} SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 484 base pal-s ' 
(3) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(XX) SEQUENCE DESCRIPTION : SEQ ID NO:224; 



TCZTGGCC^r ai^r— ~~~~~ 



-CGCC3GCC3 ACGAT3CGGG 



- -> i SGTGGQC GTACC^" 



(2) * N FORMAT I ON FOR SEQ ID N 0:225: 

U) SEQUENCE CHARACTER ^^-CS- 
fA) LENGTH: 53^ basp 
J3J "^PE: nucleic acid 
(C) STRA;roEDNES5 : s—gl- 
(2! TCPCLOGY: linear 

(n, 1 MCLECULZ -vpp. 

;x: ' sequence jesc^-^^m. . ^ 

. in g ^ j ,tq ; 225 ; 

"GGCA CCGCCAGCCA — 



CAGTTGACCG 


60 


CGAGCAG AG C 


120 


GACGCGGAGA 


iao 


CCAAAGGTGG 


240 


r'^^^^,^, _ 


300 


CCGCCTAACC 


360 


ATACo-GGGGA 


4 2 0 


GGGGGAACGC 


480 




484 



:acgactgto "sccgcagcc tgcaac^— -~ agc at_c ggccttgacg n 

AACGACATGG GCCAGGTCCA :GG™~~ ~T„^^^ ~ GGCGT ACAG CACCGGCCGC 
GTGAGGCCGG CGTGGAAGTG gIac~~^~ - CCGAAGGGT GGTAGTTGCC TGCGGAATTC ~s 
~CGGGCAGCG GAACAACCGA ^TTGCGAC^^ £~ CCCAGAGCGA GCGCACGGCG 
2-CGGTCTCGG ACGCGAACCA "GAG CGT\^ — ^IT^f CGACCGCATG CCGATGGATC 
TCCAGCGCAT ACCCGCTGGG AAG-" V^~-~r ^.--iT^"' AGAGCGCGAA "GGGATGAAG 



-^jm.^ vGr_ 

AAC 



-ACGTC32GC — J "',1 ' * " T — CCAGCCA CGGTCTTTC: 



-GGAC3A ~TT~ ^ ' ^ ' v ' - J - — ■ ^- - nAAGAAC 



RMATTGN • _ : e; 



sequengh chapa-te 



IC 1 d 

NE.G5 , 



4^ 
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Gly Gly Ala Ala Ala Gly GIr. Gin Ser Asp 

~eu He Ala Val Asp Asp Glu Arg Ala gL 

20 25 
Arg Gin Glu Ala Val Gin Gin Arg Glv Pro 

35 40 ' 

Pro Val Ala Asp Gin Gin His Pro Gly Asp 

Pro Ala Asp Pro Pro His l! p Pro uis His 

55 70 

Thr Gin Gin Gly Ala Gly Glu Pro P ro Ala 
Asp ._y Leu Pro Asp Arg Asp Gin Leu Leu 

Gla ^ a Vai Pro Glv Val Val Phe His 
115 120 

130 135 

H INFORMATION FOR 3EQ ID NO: 227: 

£ I ) SEQUENCE CHARACTER I ST I CS • 

(A) LENGTH: 156 amino acids 

(B) TYPE: ammo acid 

<C) STRANDEDNESS : sir.glo 
(D) TOPOLOGY: linear 



Val His 

Arg Arg 

Arg Gly 

Gly Ala 
60 

Gin Arg 
75 

Glu Ser 



Pro Phe 

Asp Asp 
30 

Asp Glu 
45 

Asp Gin 



Ala Asn 
15 

Glu Glu 
Ala Asp 
Cys Arg 



His Gin 
Val Val 



Thr Asp 
Pro Met 



Arg Arg 
110 
Val Val 
125 



Asp His 
80 

Thr Glu 
95 

Val Asn 
Gin His 



SEQUENCE CES; 

:vG ^Vr -•:-/ Gly Ser 

Glu Arg 1 1 \*a 1 

• r a_ 7a 1 Ser Al a Met 
35 



SIPHON: sec 



:d MO : 2 2 7 



• a^ Ala Asn 



• v As; 



Gin Gin Val cys ?r = Ala 



Ua Glu Arq He Arg Arg 

Gly Asn Asp Val Val 
3 0 

Asp Asp .eu Leu Asp Leu 
45 

Arg j'lu L-eu Asp Met: Leu 



- :: ^-^r M eu Gi v 

■ J ■ H- Thr Thr 

ICQ 

Hy Arg Leu 
115 

^ 31 y Phe GIr. 



Val Ala Mo:: 



1Q C 



Al a lie 



9 5 

->' ;i - - He Asp Val Thr 
110 

^ r 'i /a - Val Leu Val 

::s 

Ivs Asp Val Thr Tnr Leu 



A* a '/a I Ala Met 
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(A; LENGTH: 92 amino acids 

(3) TYPE : ammo acid 

(CJ STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

Ui> MOLECULE TYPE : protein 

1X1) SEQUENCE DESCRIPTION: SEQ ID NO:228- 
Pro Ala Tyr Pro Ala Gly Thr ^ ^ ^ ^ g ^ ^ ^ 

-P Gly Gly lie Val Ala Leu Pro Gin tl ^ ^ Giu g1q £ ^ 
Ala Leu Olu Ly 3 Ala Ala „ a £ a ^ g ^ g ^ ^ £ ^ ^ 

-vs Asp Arg Leu Lye Ara Glv Gl- t ^ 4 f 

- - - ' Tl / - nr Asn Leu T br- 3 X n r .._ 

55 * '"^ ^/^ 



A„ Ala 31u ser Asp C Iu ; jeu Jly Lyo MftC fj 



Glu Ala Leu Pro : ~i 

^= -v. va, uiy u y S Val Gin Ala 
85 

90 

(2) INFORMATION FOR SEQ 10 NO: 22 9: 



Val Ser Ala Leu 



fi) SEQUENCE CHARACTER I 3 Ti CS: 
iA) LENGTH: 72 ammo acids 
•:3) TYPE : amine acid 
-I STRANEEDNESS : Ginqle 
(L) TOPOLOGY ■ linear 

MOLECULE TYPE: procem 

INSCRIPTION: GEO ID NO: 109 

-a^. Hu Lou A3D 



• r al 



/a ^ 



Asp -vg Val 



21e Pro ^ - - , ^ ^ 

:q " ^ a Va - Thr Lys He He 

-S P - . .. . .... _ , 

- - ■ J - : ' - - ~eu ber Vi^ ; 1 - 



INFORMATION F~" • — 
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(Xi) SEQOENCS DESCP.IPTIOf.' : 3E Q ID WO :230: 

*!. -a as p ne o: y ^ Ala Pro Ma pro Lys ^ ^ ^ ^ ^ 

- -» «- -ro Ma Pro Thr Pro ; £ Mu p „ ^ ^ ^ 

- Ala ^ Gln pr0 Ma flly ^ ^ G ^ g ^ ^ p ^ „ v ^ ^ 

Pro Leu Va^ Ara t ^ T « » i 45 

50 ' " ^ £ a Ser *» Asn He Asp Leu Ala GIy 

Val Thr Gl;/ Thr civ Va^ r^- r> , 60 

65 ' ^ ^ ^ ^ Arg Lys Gln ^p Val Leu 

Ala Ala Ala Glu G i n Lvs , 75 80 

85 ' ' 9 ^ 3 LV " 3 ^ a P « «a Pro Ala Ala 

Gln Ala Ala Ala Ala Pro Ala p~, t f? 95 

100 ^ ^ :H p ~ Pro Glu Asp Pro „ ec 

Pro 110 



{2) INFORMATION FOR 3£ 2 ZD HQ -.221- 

(l) SEQUENCE CHAKAC^-R-5-r^5 
(A) LENGTH: 113 ^ ^ 
<B) TYPE: ammo acid 
fCJ 5TRANDEDNESS : singl* 
(D) TOPOLOGY: Linear*^"" 

MOLECULE ^P". ■ 
protein 

->:i] SEQUENCE LESCP. rPTION ■ se^ 
'.'al Leu Val Jer He 3 € 

• -v 3iu Ala \--r - • .. . ~° IS 

:; , a - ,u a Ala ? r= 

Ala Pro Lvr - , n>-^ - ~~ -iO 

3 5 ' ^ ~ ' * A ^ Pro Thr Pro 

lys .Ala olu ?ro .-Ua r 4S 

50 '" r - ?ro Ala Ala ?rc Aia 

^ '-^ 1 : ' '/* A::; :,,r Leu Ala J, r 

''/ ui'/ J:-,- 

100 '^^ r.; a Ala !-V S ^yo Ara 

A. a :.ys Ala Pro Ala ^'^ 110 



3ei ' AU ^ P G1U As P A1 - Thr Val Pro Vai Gi, 



WO W -12076 



PC T'I SW 0J26S 



(ii) MOLECULE TYPE : protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
Pro Lys val „ n Ilfl Gln Qla ue ^ ^ ^ 

Hi- «ln Ma Le U Thr 81n cln Ala AU - n ne ^ « ^ 

cm uu giu Met ^ Leu ^ ^ g H Leu Aia ^ ^ ^ 

Gin Val ^ V M ^ aln ^ ^ ^ ^ ^ ^ £ ^ ^ ^ 

Ala Gly Asp Ala Ala Lvs Ala — „ ^ * 6 ° 

o5 ?0 ^ a — ' jaU ^ Asn Asa Ala Ala Giu Ala 

?he Ala Ala Gin Leu 7,1 tu^ ^ 75 30 

^eu /a. Thr ^a Giu Gin Ser Val G 1 m a*-, r_ T ._ 

Thr Leu Hi Asn — , - " 95 

^ -a .eu ,e: Ma Ma Ma Cir. Ala Ly S Lvs Al a 

S ^ ^ £ ^ - ^ U. Ala £ Arg Thr 

- , U Ser 81a Leu Clu Cln Aid L ys M eC Gin Cln Val s.r 

£ " U " r ~ ^ ^ ^ Ma £ Gly ^ Thr Pro 

^ " ? G ^ '£ ^ *<> ^ ^ S Ar 3 T>, Ma ta £ 

— : ^er Giu Leu Ala Giu Se- 

ibc 13 : 

. oECUK:7CL G-iARACTKRIGTIGS : 
■Aj LENGTH: ™i ammo acids 
^ rvp E: 3mir , 0 dC . d 

•C) STRANDEDNESS : ::—gie 




W O 99/42076 
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in 



SEQUENCE CHARACTERISTICS : 
(Ai LENGTH: 182 ammo acids 
(B) TYPE: amino acid 
fCJ STRAND EDNESS : single 
CD) TOPOLOGY: linear 

UD MOLECULE TYPE : protein 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
Glu Val His Leu Pro Val Asp Val Giy Glu Pro ^ g1r prQ ^ ^ 
Ala val Ala Asp Gla ^ His ^ g Ue £ ^ ^ ^ ^ £ ^ 
His Thr Pro Pro Arg Val cys Gin ^ Trp Hi3 ^ g Gln £ ^ ^ 
^ ^ A ^ ^ ^ ^ ^ ^ Asp Ala " g : . eu cy S Ala 



^ii A^a Cvs A^n Va 

v Pro Arg ?he Giy Phe 1 

85 



50 



-eu Va. ^ , ly Vj: 3., ^ Arg ? ^ ^ 



Arg His Glv Pro Glv P*- 3 i— =l , . ~, 80 
<- = A., „ ly ?he prc ^„ Val a ^ 



~ys Giy :ie Arg Gin Aia „, . , 95 

10 0 3 " 3i GIU ?he Giy Val 



Leu Pro Glu Arg Ala Hla G iv Val G' Ar- w a 

. ,- J — Ar_ Acn Aon Arcr Val Ala 

Asp Arg Met: ?r j - ■ - 

- - - -eu Giy Arg 



'hr Asp Arg Leu Th: " 

130 135 
V- Pro Arg Ser VhI ' 



150 . -~ " tXG ^ G ^'* P * w 

^-.n Ara I> ^*-o :,■ } 1G 0 

, _ _ a - • .:s ^er 31n o-^ 

. ^ - " - * ^ 

Arg Ser Leu His V ai " * 175 

130 

2 format: for sec :c :;c . : - : 

i; SEQUENCE CHARACTER I STICS ■ 
-~NGTK . 16: i^mo ic:"- 

3 TRAi^EDNES S .v^i. 



-■*-- ^ql-ncc dec~r:p~:.->j 
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Gly 

65 

Lys 

Pro 

Lys 

Gly 

Gly 
145 



Lys Met Lvg Va 1 n, , 

>- Va. Axa , eu Leu Glu Ala Leu Pro Lv= Val Gly 

Val Lys Ala Gin Glu Hp Met Thr ri. ^ ^ 80 
85 Thr Glu Leu Glu ll e Ai a ?ro H;s 

Al. Ma pj. Val Ma ser yal ^ £ au ^ £ ^ 

Se, ser Ala Pro P ro ta Pr „ £ Gly ^ ^ ^ «J ^ ^ 
jeu Trp Trp Wa ^ p „ ^ Ue ^ ^ Ser , ly ^ Thr 

Pro Ala His Asn Ser Gly Ara Th. d * l4 ° 

1S 0 ' ^ ThT ?r ° ^f? T ^P Gly Gly Thr Arg 

160 



150 155 
:2) INFORMATION FOR SEQ ID NO: 236: 



fl) SEQUENCE CHARACTERISTICS ■ 
■.-0 LENGTH: I7g amino acids 
:3) TYPE: ammo acid 
•CI STRANDEONESS: single 
;Dj TOPOLOGY: linear 

^LHCULH TYPE: protein 

(Xl) ^QUENCH DESCRIPTION: SEQ in NO:236: 
Irp His Arg Gl; 



As z 



Ars 31 >' ^ Ala Asp Gin His Leu 
... 10 15 

* Ala Ala Ala Cys Asn Val Leu Leu Val 

' 3S ^ ? " ; " ^ H ^ -V - D « 31y Pro Arc Phe 

- -o .,ra /al * 



-eu Asp Ala P, 
2 3 



Va] 



35 



- Ala Cvg Gly He Arg Jin Ala 

-'al Val Pro Jlu Arg Ala His Olv y al 

^ _ ^ 5 30 

Ihr Aop Arg Leu "Thr Asd Arc; 
90 ' 
V Arg Giu Pre Arg 3e, val Jl v ~lv g^ 

: 'J5 



- D ^ 

'^1 H i :-, Ala Jlv p r ^ 



* ** u 

-i-J Arc: A^a 
15 5 



r. A* a A^a 
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!19 



<lu Ser Val Thr Glu Gly 
" eu lys Lys GI '/ ^er Val Gin Val Asp 

^ Glu Val Ser T ; r -V* Val Asp Thr Glu He Pro 



•'al Ser - ■ 



50 ■- ^ r Asp Glu ,\sp Ala 

~~ 6C 

— val jiv jiv :; u ^ v > v~ - ^ ^ 

j-/ .^a .^a Pre Am Pre Lvs ? r: 



60 
120 
180 



(C) STRANDED NESS : single 
£D) TOPOLOGY; linear 

da) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:237: 

G™^S cllrZrr^ CCCGAGCTCG ^GAATCGGT GACCGAGGGG ACCGTCATTC 
GTTGO.TGAA GAAGATCGGG GATTCGGTT'C AGGTTGACGA GCCACTCGTG GAGGTGtS 

SSSSS SSS? ATCCCGTCCC CGGTOGCTGG ssss 

ssss ssss? sssss r™™* cm3aTCG01 »• 

271 

(2) INFORMATION FOR SEQ ID NO. 23 8: 

(l) SEQUENCE CHARACTERISTICS: 
LENGTH: h 9 ammo acids 
(3) TYPE: ammo acid 
(CJ STRANDEDNESS : single 
(5) TOPOLOGY: linear 

dif MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ :d NO: 238: 
Am Lys Pro Val Leu Met Pro V 'a-- - 
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SEQUENCE CHARACTER t ST t C o . 
(A) LENGTH; 33 9 base Da Irs " 
fB) TYPE: nucleic acxd 
fC) STRAND ED NESS : single 
<3> TOPOLOGY ; linear 

ilX) M0 ^CULE TYPE: Genomxc DNA 
fxi) SEQUENCE DESCRIPTION: SEQ » NO:140: 
ATGAAGTTGA AGTTTGCTCG r^rvrn^ ^ 

-cctgcct coonoc^ ££££ GTTGTCCAGC ^gcttgtg 

GGCTATTOCC CGGGTGGCCG ATGGG^S SSSS ^^GAAA 

^accccgacg gctcgttttg gcaccagtgg ccgtctc ^ga CGGCGAGAAG 

TACTTCGATT GTGTCAGCGG CGg£££S £1™™ ««CAGTTT 

-TGGGGCAA TTCCGTCCGA CCAGCCCAAC SccS^ CGCCGCCACC ^GTGGTTGC 

>2) : ^°^-CN FOR SEQ ID >; 0;241: 

:i) SEQUENCE CHARACTERISTICS ■ 

( A ) LENGTH • in ,„ 

^i^.n. amino acids 

TYPE ; amino acid 

fCJ STRAND EDNESS : single 

(O) TOPOLOGY: linear 

1**1 MOLECULE ~vt^. 

* * prccem 

:< - ; SEQUENCE DESCRIPTION- 3» 



-v ID NC : J 4 I . 

^eu ^vc 

«eu 3er Thr- - i =, t - . - 

"*/s Ala 



^9 Leu Ser Thi Ala He Leu Glv 



Ala Ala leu Val ">hc *- a ■ , ~ U -5 

10 ^ J " A ^ a 3er Ala Asp ?ro Asn 



1/ Asc :eu A' a ■• ' - 4b 

5u ^ ?TO ASD 



ier Phe Tro s - , ~~ 50 
n =; " ' '^ e ~ T *hr Trp n 

nD - l -ly Pro Gin Phe 

H • ■ ,: : . ^ 



^FORMATION .-OR 



SEQUENCE I KARA C T E E I s ' 

' n ■ TYPE n" - ' ^ ■ a 

straneeeness 
topology 



60 
120 
180 
240 
300 
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GTGACCACGG TGGGCGTGC" ACCAArr^-~- - 

CCGGCGGCAA CGGTGGCGCC GGGGGT^ r^^ 5 CCGCGGCG - GCCGGCGGCT 
GTGGCAATGG CGGTGATGGG S^S^T ^GCCAAG GGCGGCGCCG 

CGGGCGCCCC CSGCGGCAAC GcSSS CCCCSC ^ ATCGGGGTCA 

CAGGTGGCGA CGGCGGCAAA CKCGGCAA^ rrZ ^™ TGGCA GCAAC CCCAACGGCT 
GCGCCAACAG CGOCATCGTC ^SSJS GGCTCGATCG 
GAAACGGCAG C G-C^TTCC- GTGGGGCCGG TGGCGCTGGC GGCGCCGGCG 



(2) INFORMATION FOR SEQ ID NO: 243 : 

(1) SEQUENCE CHARACTERIST-CS- 
(A) LENGTH: 424 base Dai - S 
<B) TYPE: nucleic acxd 

(C) STHANEEDNESS : smgl~ 

(D) TOPOLOGY : linear 

(XI) SEQUENCE SESCS-pttov, Tr> 

I - f - u ID NO: 243; 



GTCCGGGTCC CACCAC3"" C - G ~~,., 

TTGACTCGTT CAAGaIaX^ GCC^TTOT^ J™^" GG3 - CACCA GCCCCrrTTC 
ATGGGCAACA TCGACGTCGA CATC~CGC~ T™^" TGTTGGC:A ~G ATCGTGACCC 
GGTGGT3AGC ATCGGTCTAG ^^f™ 0 TCCA -GCGAA TCGCCGCCGC 

CCCGTCGGAG CCCGGGGTTG TGT"~"AC ---^^ GTAC "^TG CCGACGCACA 
CGTC3GCGCG CCAATGGG 2™" SGG"~r^- — CTCGGA AAGGGGTCGG TCGGCAACAT 

-GAA—Ai ^rt^CSu. GTTCACGAAG CrG"~— jr- - - , 

^"a-.JC.o ^-GTGCAA"-\ A"T ■ „ — ^-^_A.Gv> -GTTTTGGGT 

C -"C ^* UJUuCTG v.Cv.ortGGTGT ACGACGATCC 



=N ~0R 3EC :^ NO: 244 



1; 3 -0'JE;.— - ~'HARA ~ TZR T ~ — » 

c: oT?_ai»tjei;ness ainqi- 



60 
120 
180 
240 
300 
360 
37i 



60 
120 
180 
240 
30C 
J60 
420 
424 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(XI) SEQUENCE DESCRIPTION: SEC ID NO: 245: 

TGGCSTATGC GCTTCGCAGC CGGTGCCGCG TCAACGCGCC GGAGGCAATC GCTTCGCTGC 
S^ TCG ATCGCAOWST GCCGTCGTGC ACCGACACCG cS S 

GCGGAAAATC GGCCGAAATC T0QCCCTCA6 TTCACGCTCG GCGcS^CG 
GTTCTGGAAG TTGGGTGCGC GCTTCTCGGC GAACGCGCGC GGGCCTTCC- TCgStcSS 
GGACAGGAAG ACCTTGATGC CGATCTGGGT GTCGATCTTG AACGCCTCGT rrSS 

^ScG^G SSS? ' CCGCAAGAT 

■jATGGCGTCG GCAAGTTCTA GAACCTTGGT CAACGCCTGG CCGTCGGGCA CACGTGGCCG 42 0 



■ ^ C3CACACTGG 7C3ACAGCC3 GTACTCGGrA "AAATGCTGC 



information ?cp. zz~ 



sequence c-iApjiuter;-; 



lengt: 



case cairo 



' TEA NE EE ME 3 



.-l) MOLECULE TYPE: cDNA 
SEQUENCE E EST"? * - 



.■ALrrt^ ^.j'j^ Jrt ju'j.j. j^.r !C , "GC'*C'~"" " " — — ~~ - ^ „ . 

. " ~ ^ J ^ - - • GC3C T3GGTCGACG 

-^^.^ * AGT-.iAwG™ irAA'.".;-;r,V; — 

- — r^vrtOiAj^rtL GLhmGmmCjGTT ~~~ATGV" ' — _ . 



60 
120 
18C 

240 
300 
360 



42: 



(2) INFORMATION FOR SEQ ID NO: 246: 

SEQUENCE CHARACTER IS TICS : 
'A, LENGTH : 425 oase cairs 
(3) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

■11/ MOLECULE TYPE: cDNA 

a:) SEQUENCE DESCRIPTION: SEC ZD NO: 246: 

^:;r;;; -^g™ eagatc-ccc ggautcggta GTGCCGCCGG 6 
^™^ A :^; ggcgcgg cgaceataag gtcgctaatg CCCAGGTAGC 

r^r GTCG --^atgc gactct^g ctcgccgacc ^agc™* . 8 

^gtggccgt gS-— *caagtcgat cgaatgcata gtggcctcca :l 

- " " j^oi ju.v.^.H CGGC AAATGE "TTGATTTCT AGC^^^^^^^' ^ n 

71^Zll - ATC --CTGE GGGATGAATG GGAACCGCAG GATGGCGACA A AC 



jo o 

'CG ; 2 3 
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12) INFORMATION FOR SEQ ID NO : 24 8: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iii MOLECULE TYPE : prouein 

fxi) SEQUENCE DESCRIPTION : SEQ ID NO:2 49 : 
Asp His Gly Gly P,o Ala Thr ^ ?ro Qly ^ ^ ^ ^ 

I? ^ <*Y Ma Gly S ly Asn .x, Gly 



Ser Gly r?iv Lvc sly Gl - a 25 , 30 

3 q' * wi - r ' Asn 31 / ^-'/ Asd Glv Ser oh« 

•10 

— y A^a Thr Ser Glv — Va - - ** 

,0 ^ :t * er ^ C v ^ Gly ; ua Pre Glv 

5 'D 



h 5 



ly Asn Gly Gly Lys Gly Gly Ala Gly Gly 



60 



70 ?5 



Asr: Pro Asn Glv Ser 



Gly Gly Asp Gly Gly Lvs Glv ^ v As- — - v ^ ^ , 80 
□ r * - °-y A^a G.y Glv Asn Glv 

-ly He Glv ^ = *°_ 95 



le Gly Ala Asn S~r 7,; 

100 Ser Gl y Ai a 



-y Gly Ala Glv Glv Ala ■ - ^ , 110 

115 , " ' 



informat: 



SEQUENCE ritARACTERi: 
\) LENGTH: I0-; amine 



3rrur *° " iCla 



MOLECULE TYPE: 
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(2) INFORMATION FOR SEQ ID NO : 2 5 0 : 

(i) SEQUENCE CHARACTERISTICS : 
(Aj LENGTH: 41 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 

[XI) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
Ala Tyr Ala Leu Arg Ser Arg Cys Arg Va 1 Asn Ala Pro Glu Ala lie 
Ala Ser Leu p„ Arg ^ Giy Ser Ufi ^ ^ ^ ^ ^ ^ 



la Pro 



35 



Pru ?rc Ser .^n Val Asn 



40 



(2) INFORMATION FOR SEQ ID NO .251- 

(i) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 2 5 ammo acids' 
(31 TYPE: amino acid 
(CJ STRANDEDNESS: single 
!D) TOPOLOGY: linear 

(n: MOLECULE TYPE: pror.em 

lX:; 5E cra::cE description: sec :u no : 2 si: 

Val Pro .eu .\cn Thr S->- ^-^ ;w- r . 

_o Ar ^ ^'j ,vsp Leu Pre 

-'al ^ro Pro '/a. M a ^c: Leu Lt-u Ser 

2 5 

— ^formation for sec id no -5- 
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Gin Thr Glu Trp Gly Gin Pro .^n Leu Pro Vai 7a 1 d^ q 
50 ^ 60 

(2) INFORMATION FOR SEQ ID NO:253; 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE : cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

GCTTGGAGCC CTGGAGCGAC GGTGTGGC,- ^GGGGTCGA TTCGTTCTCC GCGAAAG™ 

ACT A AAG AC C ACGTTGACAC CCAACCGGCG GCCCGGCATG GGCCGTCGCG G CGTAGAAG^ 
GACCGCG GCGCGAAACG TTCGCTGCTG CGGCCCATGC AGATCGCACA CGC-GC— G 
AACATCGGGT GGAGCCGGTG GTAACGCCAG 3CT " " 

(2) INFORMATION FOR SEQ ID NO; 2 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 7 base oairs 
(3) TYPE: nucleic acid 

IC) STRANDEDNESS : single 

ID) TOPOLOGY- linear 

MOLECULE TYPE: cDNA 
lxi ' SEQUENCE ^ESCRIPTirr;: SEQ ID :JO:254: 

3rTC3GCGC:: -™CTGCGG jSGGCGCGGG TG GGGZGGGC ACCGACGGCG 
— — TCC -ACC3GC3GG ACCGGCGGAC ACGGCGGAGT JGGCGGCGAC GGCGGATGG 
TGGCACCCGG CGGGGCCGGC 3GGGCG3GC3 1G CAAGG C GG GG CAGGTGGT GC~-£c~ 

'':™ c :: iTirliz 2 A f C3GCGGG A — ~— c ^s^cgcc ggt^cgccg 

r?^l G ^' 3 — sggc™~ oCggacaggg cggcgtcggc ggcgccggcg 

^i_.-yAG<^CGG CACCGGCGGvG - - , 

^ — — v^.TT': ?GGGGGGTGT CAG TGGCACT 



IN " G RMAT I 3 N FOR 



• SEQUENCE CMARAG 
■.A, LENGTH: 42 0 case pair-, 
■.H: TYPE : nucleic acid 

C ' S TRAND EDNE ZZ 3 1 ;;g 1 - 
;L) TOPOLOGY: linear 



60 
120 
1BG 
213 
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226 



240 
300 
360 
420 



:;:::5I53: ■ GGGAC -- ccgacgccgc attgtccaac cgcctcgcct accccgaag- 
;:;^ CCGCA cgggccgcaa tcacgaccta accgaatccg AccrcGcS 

CGCCGAGC3T GACT5GGAGG ACTTCTGGGC CGCACCCGCC CAGTCGAACT CaScG^ 

.TTGAACAGC acgccgggca cctcgcccga acacatgcct tacgcggagc 

(2) INTORMATIQN FOR SEQ ID MO: 256: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ll) MOLECULE TYPE: cDNA 

(Xi : SEQUENCE DESCRIPTION: lSZQ ID MO: 256: 

JC™~ — ^I^r' GC3 ' 3AA — SGCCG CAACGCCGGT ATGCTCGCCG 

"-GC-GGC-C ^^f 0 ^ 7 T "'~™— G ^CTGCCGGT GGGGCTGGCG 

™ =i «™ SSS SSS ,. 0 

GAACGCTCGG CACCGGCGGC GCCGGCGGG 
'2; INFORMATION FOR SEQ ID MO : 2 5 7 : 

.: SEQUENCE CHARACTERISTICS : 
-A; LENGTH . 13 amine acids 

TYPE: ammo acid 
IC) STRAND EONESS : single 
D- TOPOLOGY: linear 

■ ll '-GLECULE TYPE: prrj-ein 

■ :: secuence eescripticn : cec id no : 257- 

" = ?r? ^ 31 V 7 ^ — ' V,; A.; ier ? he 3er 

- ^ i c; 

.^a Lys Val Asn 
20 



1 2 0 
180 

2 40 

2 99 



J-CUENEE "HAr^.'TER :g '2; 

^ ~engte 121 -irr,:;— i—-- 

2 TYPE. at; id 

STRANDECNESS ■ r> ; ncr : - 
:r TCFCLCGV linear 

- "CLECTYLE TYPE ;:r :r_-:r 



WO 99/42076 P( Tl SW0326S 



20 25 30 

Val Gly Gly Asp Gly Gly Trp Leu Ala Pro Gly Gly Ala Gly Gly Ala 

35 40 45 

Gly Gly Gin Gly Gly Ala Gly Gly Ala Arg Ser Asp Gly Gly Ala Leu 

50 55 60 

Gly Gly Thr Gly Gly Thr Gly Gly Thr Gly Gly Ala Gly Gly Ala Gly 
65 70 75 80 

Gly Arg Gly Thr Leu Leu Leu Gly Ala Gly GJ.y Gin Gly Gly Leu Gly 

85 go 95 

Gly Ala Gly Gly Gin Gly Gly rhr Gly Gly Gly Arg Arg Arg Trp Arg 

100 105 no 

Ser Gly Gly Cys Gin Trp His Trp Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 2 59: 

SEQUENCE CHARACTER I STI OS : 
:A) LENGTH: 34 amine acids 
■3) TYPE: amino acid 
,0) STRAND ED NESS : j ingle 
(D) TOPOLOGY: linear 

111) MOLECULE TYPE: protein 

(xi J SEQUENCE DESCRIPTION. 5EQ ID NO: 259: 

Gly Val He Gly Lys Ala Thr Ala Gin Arg Pro Val Ala Ala Gly Arg 

15 10 15 

Pro Arg Pro Arg Pro Gin Arg Vro Val Ser Aor Arg Val Ser Asc Gin 
~C 25 ' 30 



Z, INFORMATION FOR SE0 10 NO : 260 

- S ECUENCE CKARA0TER1S7I0S . 

T\) LENGTH: 9 9 ammo acids 
,31 TYPE: ammo acid 

O ' STRAND EDNES S : s ing 1 e 

O 1 ~OPOLCGY ■ 1 : -ear 

MOLECULE ,TT YP" rr->tem 



Met Leu Ala Gly Ala Ala 01' 



Slv 01 v Asn Ala Gl v 



Sly ?he ser Phe 
30 

V.v 0]v Leu Phe 
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Gly Gly Ala Gly Gly ? he Giy Asp His Gly Thr Leu Gly Thr Glv Gly 

Ala Gly Gly 95 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

TCCTGTTCGG CGCCGGCGGG GTGGGCGGTG TTGGCGGTGA -nr— r^r* -~~r*~ 
CGGCGCGCGG ^GGGC— "^'^ ^-GGTG i GGCn TTCCTGGGCA 

GCGGCGGCGG CG^G^ GGGCCGGTGG GCTGTTCAGC GTCGGTGGGG 

_ r ™~ ^-ATTGCTCG GGAACAGCGG TGCCGGGGGG ^CCGGCGGGT 

;::. c ;5Ir^ ogcg^g gcc**^ tgggctcS 

~^.^, UJ a^.GGGe GGCAACGCCA GCCTGCTGGT AA 

(2) INFORMATION FOR SEQ ID NO: 262: 

■ i ) SEQUENCE lrL\RACTERI5TICS : 
(A) LZKGTZ; 415 base oairs 
(3) TYPE: nucleic acid 
fO STRAND EDNESS : single 
(D) TOPOLOGY: linear 




o J 
120 
ISO 

232 



aatgccacac 'IS"? ^^c~ ScSSg ^ 

jAGCCGGACA ACGC3AC* ~ '^r^" CCGAC CGCTGCCACC 13C 

■■"TGCGC^" - — -™.~ . ™ ^JlT'^ ^- rt ^'" GACTGAAGAA TCCAAGTTCT : : 

-^ACA^CC GGAAGAAGGT GG^ACGA^ ^ll:^ ^ C3GGAA '- "AACCGGCGG 2^ 

ATJGTGG-AG -DA" - - ^1™ -C03GTGAG CGCCAACGCA ^ 

: inf:?maticn f~p. ^ ; _ s3 . 

:i l SEQUENCE C3£ARACTERISTIC5 : 
-A; LENGTH : 3^3 base pairs 
3' TYPE: nucleic acid 
:C 5 TRANCED ME S3 j i ng I 
T2F2L0GV linear 



WO 99/42076 



229 
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(2 J INFORMATION FOR SEQ ID NO; 264: 

<i) SEQUENCE CHARACTERISTICS - 
(A) LENGTH : 423 base pairs' 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



— ■ — fcjiitiT, 

3EQUENCE DESCRIPTION: 



3EQ ID ;jr 



C^CC C S SSS CACWTOCGC oncac™ 

GGCC3AG7AC CCGCGGGCCG AAC^CAGC- Ir^lT^ CGCGCGTTCG GTGTCGTCCG 
CCACTCGTGC CGAATTC3AG CTC™^ C3CG " TC ^ ACGGGGATC3 

;s ~- -™ s^c? ss 

AGAGGCCCGC ACCCATCGCC ~- ^ rt 1 «AC"TGGC CTACTAGCGA 



AATGGACOC 



; 3EQUENCI HiARACTERI.~TICS 
A ' LENGTH . 404 oasn 

3 tralt ednes s . . ; ^ ;)y > 



50 
12C 



AGTCCGACCA ££3™ CG7AT ™ 1CAGCGAGCA GATCAAGTCC 

TATTGAGrrc TGGCGTTGGA r^T™^ TTTCGCCTGG GACGAGACGG 

CGGTTGTCAA TAC^SS TGGGGGATAT « 

CGGATCCCTT TTCGAACGTT GC3CAAGCGC ^C^S =4 0 

GGCGTAGCTG CGCGGCCTCG ATCGGTTTGA £^£3* ACTGC -GCT 3 0 0 

ACCTGACGCT CCT AC.TCATCGC AATTCCCGCA ATGGGTGAGT 36o 



373 



6C 
12 0 
180 
24 0 
300 
"360 
42 0 
4 2 3 
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.. . - , , ____ " -^ 3 ^^"-3G GCACGAAACC 



2 information -or ^ ::c-.2oS. 



i , 3 EQ UENC E CHAR A OTER I s t : c s 



60 
120 
180 
240 

300 
360 
420 

421 



(D) TOPOLOGY: linear 
(ll) MOLECULE TYPE: cDNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:266: 

GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGG CT AA CT TCGCACCCGG GTATCCGCCC 
ACCATCGAAC CCGCCCAACC GGCGCTGTCA CCGCCTACTT CGCAAGACCC GgJSSS 

G"""™ SSS? CCCCCGGGCG gcact ™ acaacggcac ccqSSSJ 

GT^UYCTGC CCZCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 
ATGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 
GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA T^CC^c 
GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 

12) INFORMATION FOR SEQ ID NO : 2 b 7 : 

;i; SEQUENCE CHARACTERISTICS- 
^) -ENGTK : 4 2 6 oase -jairs 
(3) TYPE: nucleic acid 
CC) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

■:xi: SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ATGCATATCA CG CTCAACGC CA^C^CC*'" - — ^^r,^^ 

GGGCC^CGG CAGTGAACTA 

-,_-, r _ 1 " jJ - GiOU -^CG^.GG GCTCGCGCOT GGCGGCGCTA 

™„ _____ ^ AU ^ JiG ^ GCGGCTGGC 2GAGTGGCGG i 8 0 

____ 7 "^^^^ - ^'^^^^ AGCGGGCCGA 7CCGAACTTC :-;q 



50 
120 



^T^^ ^u rt C^ _ . . jGCGTTG/vTG ""GCGCAGCA ~— ir"s^- -™. T „^_ 

, w _ ^l,^ . --^TTCCATC J 0 c 

- rtU^rt^w -n, CGACGAACTG "~ 




WO 99/42076 



PC T/l S99/03268 



CGCCGCTCCG ACGGCAAGC** — , 

(2) INFORMATION FOR SEQ ID NO: 269 



SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 73 9 base pairs 

iB) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
CO) TOPOLOGY- linear 

11) MOLECULE TYPE: cDNA 

SEQ ID NO: 2 59: 



SEQC^NCE DESCRIPTION 



AGCG 



INFORMATION ? C R SEC 

• SEQUENCE CHARACTER 1ST'"'- 
•A) LZXGTX: 6 9 base — ^ 
■2; TYPE : nuci, :: icl . 
C 1 GTRANDEONESG . si----,* 
D - APOLOGY. .: nea r' 

- MCLECTJLE TYPE . ;ONA 

CE0:JKNCr- ; ; .... 

jAGG/"1;" — - 



NO : 



48 

52 



SIS" SSIf! =;™===a — o^e oao*^ 

~~ iig = SSS s SS j~ 

^T^r g^caaaccg atcTc^tcga tS?^ — c~g^? GGC7cgt - t 

CC^CATC^ KCcSS SJ™^ AGGATCCGGT £ A TCc££ ACCESS 

aagccgtcgt cgcctttccg f GGCGA — ctacgaacat tgggtgaacc 

ACAAATGGAA GAACTGCGCA GGC^eGC ^GTGCAG ACTTCGGCCG 

ggtggacgtt tgccgacgtc aaaggcagc gaataaggcc aagacctacc 

AAGGCGCTGA GGGCTGGGAA ^ ^GGTGATA GACACCCAAG 

ACGTCAACGC ATGCGGGTAC CAG^C^ ^^f:! CTGGTTGTCG 

jTtgacaaag tcaacaagg ~" " ' ~ 



:bc' 

2-5 0 
300 
360 
420 
480 
540 
600 
660 
720 
739 



: INFORMATION E2P. - 

-^CYENCE CKAKACTr-^—" 
A, LENGTH: baG"^^--"" 
3 TYPE - , 
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c~^" S^^xf: £^ «^=c= ^ C = K 

.a**™ o.^::;; ™ G ™S£ 

SSSSt TCflcccCG " ™= 
~ ™ = ™ SSS SSS 
3255 2£S ™ ~ = = 



60 
120 
180 
240 
300 
360 
420 
480 
523 



(2) INFORMATION FOR SEQ ID NG:272: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH : 224 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : smqle 
:D) TOPOLOGY : linear 

MOLECULE TYPE: cCNA 

IXD SE2CENCE DESCRIPTION: SE2 ID NO : 272- 

o.CGGTGT CGTCGGGGTA 3GAGCGACT'" ~~~r- r ~r^ r ~ 

2CAGGAAC CGGT3.~C3G~ ■ ~~r~ ^ ^-^CGCC GGAGCGGGCT 60 

1CCTTGGC ~ GCG — ~^ -Z^^ — -ACCAA AGGCCGGATC GATTCGGCCA 12n 
:GCT:GAAG GGGCCC3GCG T^GG~ ^^Sl GCCAT ACCACAAACC : 8C 

(2; INFORMATION FOR .5 Eg 10 NO: 273: 
: S H Q OENCE CIIARACTER I : > T I 0 3 ■ 
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;c; STRAND ED NESS : single 
<3> TOPOLOGY: linear 

(11; molecule type : com 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 74; 

SSS SSS £S££ SS 0 "" " M « 0 « 

GCGAACCAGC CCGTCGCcS TckScS ^ ° GAACC ^G CC CAGGGG CA 

«CGAAG« T TTGACGCGTC SSSS SSS? 

CCGAGCTGAG AGCGTAGCGC CTCGAGCTCA cSSS^ CTGG ^CGCC 
TCGCTGGTCT TGTTCCCGCG CAGCAC-GC ^"CCAGCAC CCAGTCCACC 

GTGACCCKA ACGCCGGCAC ^3^°™ '^CGTCGAC ATCGGCGCGG 

TCGCCA ^ ijTCGCCC GCGGCAGGGG CGGCAACTGC 



•X ?CR SEQ ID NO: 275 



3ECJENCE CHARACTEP -^s—CS ■ 
;A) LENGTH : 219 oase oairs 
<3> TYPE: nucleic acid 
— < 3TRANDEDNESS : single 
^' TOPOLOGY : linear 



— --.N : oEO ID NC : 27 5 



^ MCLECYLE TYPE : cDN« 
SEQUENCE 

GCGGACACGG CCGACAAAGC G~^'" — — t^-^-,, 

AGGGC3GCGC CGGCCGCGGA G ^ CG '™GGCGCC GACGGGGG CC 

3GCGG7GG7A —ZCGGGCZZ ACCGC~j™ ^"^f ^^^^^ GGCGGGG CAA 

GCCGGTGGGC IGATCGGCAA "CG~GGG~''~ — -X^^T* ~ CAT::;GGC ^ CGCGGGTGGC 



■■-vJENCE niAPj^CTEP:. 
A; LENGTH: 5^1 case - 
■3> TYPE : nucleic L;:, 
7 tr a;,- ed:^: G 7 . , : » - 



i * Li 

airs 
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(2 J INFORMATION FOR SZQ ID NO:: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 amino acids 

(B) TYPE : ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 77: 

T Gl7 ^ ^ ^ V - ~>' OlY Val Gly Gly tep Gly V al A,a 

5 10 , 5 

? " S " GU Gly ^ Ala ?r ° G1 >' Gly Gly Ala Gly Glv Ala Glv 

^„ .... ~ 5 30 

-e. .... Iiy ^ Jlv A1 lv 

7 c , . u v ^ ; ^eu 

,xy Asn Ser Glv Ala Glv Glv se- o„ ^ 

• ' ^J-y Ser A^a Leu Lt*u 

5^ - - - — 



5 r 



60 



.3p Gly Gly Aid Gly Gly Aia Gly Gly Val Gi 



70 ' ' - — ~~ ^ T - G - 



T 5 



7 j "/ Ala Glv Gl'/ A' i 'V-' ^1-, i 

. ,^a u i y ^ S:1 Ala ber Leu Leu 

:::fcrmaticn fcr sec :g :;c: - 3; 

^E-UENCE CHARACTERISTICS : 
•■^ LENGTH; 25 amino acids 
3; TYPE; amino acid 
CI STRANDEDNESS: smale 
■r,i TOPOLOGY : linear 



80 



ION; 



: 2 
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Lys Pro Asp Axy Pro Ala Ala Thr Vai Gly Ser Cys Thr Thr Vai Arg 
1 5 10 15 

a Pro Cys Ser Gin Pro Vai Thr Thr Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 28 0: 

(l) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 0 amino acids 
CB) TYPE ; ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

Trp Pro Ala Gly Ar-r ?ro Met His Pro Ala Pro GZ 

His Pro Pro Asn 

2C 



mr Ser 



(2; INFORMATION FOR SE^ ID NO -28' • 

SZQUEXCz. CIHAPACTEPISTICG 
■. A ; LENGTH : \ { C am i nc aci ds 
(B; TYPE : ammo acid 
:c: STRANDEDNESS: s^g.e 
(D) TOPOLOGY: linear 



MOLECULE TYPE, orctem 




W O 99/42076 
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Me: 

i 

Leu 

A™ 



SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 142 ammo acids 
(3) TYPE : amino acid 

(C) STRANDEDNES5 . single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ IP NC:232: 

His lie Thr Leu Asn Ala He Leu Arg Ala He Phe Gly 

5 10 
Ser Glu Leu Asp Glu Leu Arg Arg Leu ii„ u ro P . Q 

20 25 
Gly Ser Arg Leu Ala Ala Leu Pro Lys Pro Lvs 

40 

- - *j J — - u A_l a Glu T rr? Ara 



35 



He Asd 



Arg 
45 

Arg 



Trp 

30 

Asp 



Ala 
15 
Val 

Tyr 



Gin 



n 0 



Ala 



^ Leu He Glu Ala Glu Ar:; Ala 
70 

Asp Arg Thr A.sp Val Leu Ala Leu Mez Leu Arg 

8 5 q c 

Gly Ser He Me: Ser Arg Lvs Asd H- r 

ioo 105 

^eu Ala Ala Gly H :s Glu Thr Thr Ala Ala Thr 
~ 15 120 



Asd Glu 



Ser Thr Tvr 
Leu 



Asn 

130 



3er Thr 



Trp 
125 



yrr- Thr Cys ler Arg 
14 0 



Leu 
110 
Ala 

Trp 



G 1 v 



Gly 

Thr 

Gly 

Asp 

Phe 

30 

Asp 

Thr 

Arc? 



H INFORMATION FCR SEQ : D ? ; C . : a 3 

SEQUENCE CILARACTERIGT7CS 
~£.NCTH . 1 -j 3 amino , i c : d s 
TYPE : ammo acid 
S TRAND EDNE S G . Jinql e 
TOPOLOGY: linear 
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Tyr Phe Val Ala Asp Leu Ser 
115 

Ala Asp Ala Ala His Thr 
130 

31 y Lys Leu Val L> 



Ser Gly His Thr Ala Arq Val Asr; Val 
120 



Asp Phe Thr Ala Tie Ala Ara 



145 

Lys Asn Pre 



135 

eu Gly Ser Ala Asp Gl 
150 



^rg ier Asp 



140 



y Ala Val Tyr Thr Leu Ala 



155 



160 



(2) INFORMATION FOR SEQ ID NO:284: 

H) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: ammo acid 

(C) 3TRANDEHNESS : single 

ii) MOLECULE TYPE: protein 
;Xi: SE ^^-S INSCRIPTION: SEQ ID NO; 284 
y Ala Pro Pre Ser Gly Gly p ro S er P: 



Trp Axa Gin Thr Pro 

1 0 

Ihr As- Pre Trrj 



20 ' A " 3 Gly Ala Ala Va - Val 

Leu Val Leu Va. Leu 3 ' ■ ' si, — ^ ^ 3 ° 

J ^ 4 0 

? "" " a - Gi- GIu Ar^ Leu Ser Ala 

^ 60 

-eu -eu ^ei; Asn S**- ■ . , 

: = . " " ^ /al Me - G: V Ser 3e: 

Sir. ?r;: , M 5 ^ Bc 
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iA] i^ GTH: :2 amino acids 
fBJ . YPE: amino acid 

(C) STRANDEDNESS .- single 
fDJ TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(X1) SE '2™NCE DESCRIPTION: SEQ ID NO.-28S: 

Asp Val Val GIu Ala Ala li e Ala Am ai ^ 
i 5 ^ a ^ Ala Glu Ala Val Asn Pro Ala 

Leu Asn Ala Leu Ala Tyr 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 2 86: 

' 1 ; SEQUENCE CHARACTER I S TI OS : 
A, LENGTH: — 4 atr.mc acids 
;BJ TYPE: ammo acia 
(C) STRANDEDNESS: single 
- -'^ w>u_/Lj : ; linear 

MOLECULE TYPE: protein 
^ SEQUENCE DESCRIPTION: SEQ ID NC : 2 3 6 
? ^ G ; V Thr A3:: ^ Ser -Y Gin Leu Ala Leu Pro 



Pro a ~, 



• vrq Ser Yal Pro 



15 



Civ GIu Pro Leu Gly 



Ala Ala ^r: ~ ■ , , • - - 

7 ^ J " ?he Agd Asd Arc; Asn 

^ 4 0 " 



Clu Art: -- a : - _ . 

5 C """ ~ — J ^ a Clu A.j Leu Pro 

--r: Leu Pur Pro • - , , , 

-j 3 -, Q ^ ° e - - — -eu .-vrg 

^ ^ - ^ ^ Ar, val , ; aL , eu G I v Yal Z 

^ _ , " ^3 



31u Leu Leu Pre Pr.^ v-- ' ** " 

" " — : --^ -iy Civ Pro 

"° r Ala Va: Val Th: ALi .\r~: s-r L-u 

16 5 . ., „ 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : protein 

(XI) SEQUENCE DESCRIPTION: SEQ TD NO:287: 

val Gly val val G l y v al sly Wa Thr ^ prQ ^ ^ ^ ^ ^ 

«/ Ala CI, Ser Ma Gly Thr Giy ^ ^ ^ Qly ^ £ 

^ 2 5 

Lys Glv Arq He Asn , ^ - !\ 30 

- ^9 Asp Se. Axa Ser Aia Leu Ma Ala Pro Leu Ser Thr 

-V Leu Leu Ala Val Pro Ser Hie Thr Thr Asn Gin ^ 



55 60 



N'C' : 2 a a : 



(2) IITFOPMATICM POP SLQ 

:ii sequence characteristics: 

^A) LENGTH: 133 ammo ac^ac 

(3) TYPE: ammo acid 

(C) STRANDEDNESS : singlo 

O) TOPOLOGY: linear 

(ii) MOLECULE 7Y?E: protein 

' x: ' SEQUENCE CECCR^^om ■ c—- , T ,^ 

^>-« ^0 MO : 2 38 . 

Ala Asn Th- 01" q ^ - ^ ^ - 

^ 7ai 1 L - u A ~9 Hig Gly Glu Ser Asp 

~rr; Ala Leu Asn Leu ^« >- — , 15 

zo "~ Asp Yal Civ Leu Thr 

^ - /S 31 < ^ A - ^ Val Jl; ^ ^ y j? , 

... ^ 4c ' " ' ' Jxa :r G1;; 

5C " ' ' ' ^ 1 7 " r ~ eu Arg Ala 

^ Thr Thr Ala Hi:; Leu Va - ~ , °° 

r.b *'^ p Aia -eu Trp He 



3 5 

leu Asd Lv: 



i> ^ r Cj ^ n Ph* 1 



^ a 



II J" EC RMAlT I 0 N 



^ for seq :o :;o:ib^ 



WO 99/42076 



l > CTa'S99 0326S 



240 



fxi) SEQUENCE DESCRIPTION : SEg ZD NO:289: 

Pro Gly Ser Phe Ala Arg Thx Lys Pro Pro Gly Arc Thr Ala Asp Ala 

5 10 15 

Pro He Arg Cys Arg Asp Ser Arg Gly Thr Ala Gly His Arg Ala Leu 

20 25 30 

Asp Glu Pro Pro Pro Arg Gly Ser Glu Pro Ala Arg Arg Arg Ser Arg 

35 40 45 

Gly Vai Arg Thr Val Val His Asp Ser Leu Ala Ala Arg Arg Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino srHs 

(B) TYPE : ammo acid 

1 C ) S TRAND EDNE S3: single 
ID) TOPOLOGY: linear 

MOLECULE TYPE: protein 

X1 SEQUENCE DESCRIPTION: SEQ ID "0:290: 

31 -3 Glv Glv 3 *, Va — - V ^ , 

* %J ~- r ^y S^y Ala 31'/ Glv 

5 1G 15* 

rt 3 P _v G.y Gin Sly sly Ala Gly Arg Glv Leu Trr, Glv T w G 1 " G-' 

Ala Glv Glv r • ^ ^ ; 5 ' ^ " " 

^ a ^ r ^ V Arg Hl3 '"rp Ara Arg Trp Ala Asp 

' " -5 C 



INFORMATION FOR 31 
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?ro lie Pro Val Arg Ala Ala His His Glu 



7 0 



(2 J INFORMATION FOR SEQ ID NC:292: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 174 base oai-s 

(B) TYPE: nucleic acid 
fC) STRAND ED NESS . single 
(D) TOPOLOGY: linear 

(u) MCLSCULE TYPE: cDNA 

Cxi; SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

-^GCACGTAA CACCG^G Al m *• r* — ^- ~ 

^ Ji ^^^^ -^AmGvtvj^GC CCCTCGTOAT - ^^^^^^^ 

-GAAL'jGTTA TTCAC^v^^"'^ „ "* T ' "* * — <-^*.x^v-oT 

~r.~^ „ ^ ,-, _ , . _ -^^^^vjLLrtL -AMU . OOTGA ~ 

- - ■ ^ _ JV3\ [-1 -if , — ^ . - 



30 



OACAx.CG ACAGCA3GGC CGCG * . 



NO: 192 



:2; INFORMATION FOR 3EQ 

( l ) S E Q UENC E CHARACTER I "TICS ■ 
(A) EENSTH: 404 base na- s 
;' B) 'lYPE : nucleic acid 

^ TOPOLOGY: linear *" * 

MCLSO-LZ TYPE: cDNA 

;X1 3E «™-2 -ESCRIPTIIN. SEQ ID NO : 2 y i 



-GCAAACGG 3C 
CTOGGCAC CATS D"G I'D" JAAAGCI 



--''----n-- jj.-^j"73 ' """" ^ ^; * — ^ 

^ ^ ^^^^ \_ ^ - j i j i 

^^r,^, „ 00^"^ ~~ ~ " J ^ j:AU " J EGGGT .3 3'TGG t'^q-^q'-/-.,-.™ 

_ _ _ ^ 7 - -'-CO 33 3 3 A CO TOG AO A™ — — — — , , ^ _ _ * ''~ 5 " 

. CC3ACCGC ACCC~G'""' , G • — ^ ~ " * * J y J ^ lL ^u-iG OA C 3 34 f,- 

»\CGTACC3 GGAGCTGGa" ™™f™ " VaGT ""' j - g - - GATGGTCAGC 
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LeU VaI GIu Val Leu Asd 3iv Al. 

35 * 40 

LeU ^ Y ^9 Val Leu ,Ua Val 



65 



70 



85 
Ala 

100 



90 
i Ala 

10Z 



Th.r 


Met 


Ala 
45 


Pro 




Ser 


Arg 


Leu 
60 


Val 


Ser 


Arg 


Trp 


Leu 
75 

Thr 


Asp 
Ser 


Gly 
Ala 


Glu 
Pro 


His 
Ala 


Asp 

80 

Val 


Gin 


Pro 


Asp 


Ser 


95 
Val 


Ala 



Val Ser Trp Ala Asp Gly Gin Leu ; HO 

H5 ^ 0 ^ ^9 Asp Ma Leu 

Ala Asp Arg Leu Ala Thr- *~ 125 

130 

INFOPi^TlCN F^P — ^ 



.A; LZNG7H: 325 base oai 
[3 > nuclei . iL -id 

<G) STRATOSDNSSS : . jinglp 

(D) TOPOLOGY: linear 

MCLSGULE TYPE: jONA 

3AGGG 



:gagga gctggcacaa gaggg^gtgg agag~— — ^ CTCC:GGC ^ c - a gttcg 

GGGAAGATG j'TTG" ^" - — ~ ^ ~ „ , ^ * A * ° " * w ^-^GGGTGTGG 



GGCA GG7GGGGG ~G \A— — ^IIlT'""^ rTTCA ^ C ^ 



i^~ormat::g; 



P^A\T EG :;e S 3 - ; r. cj ' . 
i'-^^j:' t ;;:ear 



60 



J - - - -GAGGG GTAGGAGTAG GTGT— 

:gaagaagac cacgaaggg- l"' 7 ^ ~ " TGC GGGc -~CTAG gacgacaagg 

TGGGGAAACE ~^G~ V — ~ — """^ -^--^GG^TG AAATACTT: 

^=~g~c~ la^fS :::Z1:~ C . iso 



^AAGG J 0 r 



' J " J ' - - - GGG7CGG -ACAAGAAG^ — - - ^ _ ^ „ _ ^. ^ v "* kvJVJ ' lv -'^- - --TAGGAC ~ 12 C 

GCGGGGGC GACGTGAGCG GAGA— — - ^1?!^° AC " AC SGGAG CTGA7GGGTG 



AGATGCGG" — * — — — 180 

-GGCAG 



5 2 6 



MC LEGGED 
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^G^GATAGG ACAGGGCGAG CAAGCCACCG GCAACG~ — — 

AGACCGTACT GCACCAACCT GAAGAGCTGA A^~c~ G 
AATTGGG ^CT^C, ^ACGTGCAAC AGCTGCGAAC 

(2) INFORMATION FOR SEQ ID NO: 297: 

CD SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 523 base pairs ' 
(B; TYPE: nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

di) MOLECULE TYPE: cDNA 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 : 



ACGAAGCGCG AGAATA'^GAG . ^ 



- ■ — ^ j 



--VJ.GTA CGAGCTTGAG 



^---j- CGCCGTGGT" ~GGTG~""r ^ m 
'GG CCATGCGAC" — — ~~ GAAGGTTTGT 



: = INFORMATION ?OR 3EQ ID NO : 2 58 : 

^ SEQUENCE CI-IARACTERISTICC 
~; -cNGTII. 510 ;^ase ca:^ 
..^^e:r acid 



C STTIALTELNES 



; ;ngie 
1: ::«ar 



480 
48^ 



:GCCCAC - CAAGGCGGCC ctggacaca 
TTTCAA GACCGA-A- — — ^ACTGGACTA CCGCTCGCGG CGGCCATTAA ^0 

C3ACAG CAT~GGC^CC C"^~^^ — ATGATCC TGAGCTAAGC CTGTATGCGC 
^CGCIT CATIACCGC- ;f TGGC — TTTGGAGCCG GACCTGAAGI 



^AGCGCTT CATIACCGCC ™- GA^ G~ -v~~Z« ** i ^ b '- G GACCTGAAGI 360 

IGGCCTOG GCA — r-^^^Z ^ U ^ GAGCG CCTGGGTGTA CGGCAGAACC 4 -n 

A ~™_,_ — -^C-C ^ACCGATCAC GATGACCGCT 40 D 

A^_„^_ TTCAACC3TT CGAT-— 1 
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(C) STRANDEDNESS : single 
!0! TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 
Phe Asp Gly ^ « u ^ Leu ^ g ^ ^ ^ ^ 

A-P Asp Lys Ma Lys Lys Thr Thr Lys £ ^ ^ £ ^ 

V.1 Gly V 3i Lys IVr L . u ^ Leu 61y Ala Gly Glu £ ^ ^ 

G:y Z p sor ^ ^ ^ " v Aan Glu ?h * Le ^ ^ e 

Ala Gin Gin Ala Vai Glu Th- ' eu A ~n - fu 

' _ ~ ' a ~ J — 

:hr val ^ ;; 5 ^ - ^ - H, 3 ,. /a Phe !; r 

Ii^ 31 y Lys Glu Tvr Ar- v v - - ~, 95 

loc * - T >~ Thr Val Leu His 

H " Thir A - ^ ^ ^ Asp Lys Arg L eu Val p^o 

1 "> Q 

Val Thr Pro Val S^- CT - ^ -T 12S 

13" ' -^3 Asp Pre: Cys Tvr Lou 

3-y A:;: H: 3 Asr; Lvc Val - 

145 * 150 ^ J — D -eu lie Gly Al a 

•-a - a 3 ^ */ A.I. a Th T * ' -60 



' 1 : : :;fo rmat : c : ; y o r 3 e ~ ■ 

*■ -r-ilGT}* : iui ammo aci^ 



amine acic 



3TRANDEDrJES3 

line a 



^ 1 na I e 

OLOGY 
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100 :os 

Leu Thr Gin Ara As- riv n , 110 

J — r Gly Ala Ai-i He niv m~ ~n 

115 12Q * 31 :i Gin Ala 

Thr Gly Asa Ala Glv His He AIa r 1 -, T 125 

130 " l3s ^ a HlS Leu ^ Val Leu His 

Gin Pro Glu Glu Leu Asn Thr Arg Arq Thr rv a" 

145 9 ^ Thr °/ s Asn 2er Cys Glu Gin 

Leu ' LJO 160 



(2) INFORMATION FOR SEQ ID NO: 301: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 17 5 amiao 

(B) TYPE: amino acid 

(CI 3TRANDSDNESS : smal--. 
>D! TOPOLOGY; linear 

i-D MOLECULE TYPE: procem 

SEANCE ^SCP.TPTIC::: I2 KO:30I. 

J " A. a Arg Glu Tyr 3lu ?r;j G 1 " :T - n r . 
1 5 ° — ; r - Tyr Glu Leu Glu 

?he Pro Ala ~ „ 10 

2o ^. * - - Pro Val Leu 

7al H - -v ?he Ser A^ a ' i v_ ™ 

35 #n - — - ..^ ^ a Arg Leu 

„ - . « r 

.-^a .^a a^<i Ku; Le- * - ^ ^ " 

5 0 ~„ - '^al A. a Ser 

Ph'2 Ala ' 1 - t< ,„ - • . , . ' J ^ 

■j 5 . . .^ r - . , 3r 

"*~° - J ^' 2 - nr - ; " ; ^r Asc As- ;-r~ 



la Lea \r~r 



^ j ~ .■ ..... ;'rc r*ne 



9 5 

eu Leu Aid 
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;xij SEQUENCE DESCRIPTION: GEO ID NC:302. 

Lys Pro Val Lys Glu Pro Val Pro Ala Leu Pro Pro Val Pro Pro Thr 

15 10 is 

Pro Ala Leu Pro Pro Leu Pro Pro Leu Pro Pro Val Pro Gly Phe Pro 

20 25 30 

Thr Val Pro Pro Pro Gly Ser Met Ala Pro Leu Phe Arg Pro Phe Ser 

35 40 45 

Pro Ala Pro Pro Ser Pre Ala Leu Pro Pro Ser Pro Pr^ Leu Pre Pro 

60 



50 55 



Leu Val Gly Val Ala Ala 
65 70 



Trp Leu Thr Tyr Cys Ser Thr Gly Pro Ala 



75 80 
Leu Asp Pro Leu Ala Val Ser lie Ala Ala Ser Met Asp Pro Pro Thr 

85 y 0 95 

Thr Thr Cys Glu Ala Ser Pro Ala Ala Ala Ala Ala Gin Leu Cvs Arg 

^00 :os I1Q 

J^y je; Cys Asp Leu Ala Pro .Ala Asp Glu Met Met Gly Thr Thr Glv 

T 5 120 —5 

Ala 1/5 Gly Arg Leu Gly Glu Ala Ser Ala Glv Ger Arg Ser Arg His 

130 :3S i40 

Thr Arg Arg Cys Ala Ala Ala Ser Glu I ] -? cvd An Lou Arf Cvs Th- 

: 45 150 155 " ^ 160 

3 Ar-; Asp Trp Va] Ser ?rn Leu Ala Pre 

1-0 i-s 



.-vrg aer Ser jer Gly Va 
15 5 



(2. INFORMATION FOR GEO ID NO: 303: 

SEQUENCE CHARACTER IG TICS : 
Aj LENGTH: 32 1 case oairs 
3. 1 TYPE: nucleic acid 
C : " TRANDEDME S S : ^ir.aio 

. L TOPOLOGY: linear 

iG MOLECULE TYPE: Genomic ON A 

o jequence description: seg :o :;c:3gi 



.J >_>*._ rtuv. - _ '.J 

jv-A^-jCTA 1 
. GGGTGAAT 



. ^3 ^ -uJ TACGAAAAGT 



■vj'. — ru-v^.j.TOi - - - j - jGAG 1 j G j- uVv TT A T C AGGGCTCATT 

" " - - - ~ Jf -' • - -■•uv . - _Art_ 20IT0NACAA GCTTGCGGCI 

jATTGAGTTT AAO C G 0 TN Ar; IAATAACTAG GATAAC C OCT 

. -'rtrt.j^.j . . , 11 IGGTGAAA GGANGAACTA TATCCGGATA 

-•^-^GCGAT'G 1CGTTC0CAA GAG T T G C G CA CCKGAATGGC 

tSoattaac:; ggggggtgtn jgkgttaccg ggacgtnaco 
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2 INFORMATION FOR SEQ ID NO: 304: 

f i : SEQUENCE CHARACTERISTICS : 
fA; LENGTH : 10 82 base pairs 
(3) TYPE: nucleic acid 

(C) STPANDEDNES5 : single 

(D) TOPOLOGY: linear 

(i-i MOLECULE TYPE: Genomic DNA 

(XI, SEQUENCE DESCRIPTION: SEQ ID NO: 3 04 



ACcSSJ ^CGCAGCCG 3C3GGACCGT C3CCAGCACC 

acoco^ ^ G ^:: 5:M GGTGA I - 



120 

•I'GCCT CGAACANGCC — — OATGGCCGCG 130 
GAAACCCACA C3CC^-G : ^'^^Z ^ G "" AGC NCCAA "CCGCACCCA : 4 0 

CCATCGC^JA ":fu T r^T^« — - ~ - - ' ~ ° — -^--uua,, OGANATCCC" 

jCtgccggaa cccccc-gc- — 

GC3GGGCAAC C3CGAACCCA N'GAACAC— - — ~™ * ~=TGG 360 

TAAACG CTTCACATCC A^^^^ ^GCAACAG CAATT3TCAA 4 20 

4 a o 

-"JGGGANCCC CCCGTGSCMM NGTC-CGG^ T^^": ^:^ G ? A ™AAA7CAGC 6 00 

66 0 

~~^AA ; iq 



__ >T _ rtWllLy - — CGCCACA CCGTCGGMTC n5~AG~cr» 

— - ~ ^ * N GvjGCGGNCA" — ^-Hij.^LO/i 

iTGSAAGGN CCMACSfflT "■ ^ CAGKCTAGGT C77CGGCCGA 

5GGGANCCC C"™GS^ .'^^^ > AAAAAAST - TCCCANTGGA 7AAAATCAGC 

-CAAC7X3C CaIa.^AA- ^"--^ ATT2STTCaAC MGGTTTMAC3 3CGGK73CMG 

AAAACCGGNC ^rZZZKG^ 7AMMACCGG" '—IZ^ ^f^fx* ;,GC ""~~ 

AMCCY""- 3-; mr— .—r, -...„„„_ ' ""'^ --^K.^K.CC CANGl.TYAAC 

AAAC-SGKVG W^™. -~ " "3IJGGGGTTC SiTKGTTSCYT AWMCCCCCGG 

CCT-GGGAAW ~C.- - ' ^^l" ^GGCCAN KNRAAWGKYT 



34 0 
90 0 



Avw-rrccNc gwttcawtco igt^a^ :;:^ tGS3CN f^™ TYN ^gggaacs geo 
:~3T':nanan aaajat**^ - -^^V^?™"! ^r_"*lL: '-'cgssc stccmggsnc 1020 

- "-MGP.RNKP. 3MG 



>^J-r-w^. .„ ^ ;-. 10 BO 

1032 
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CCCYAACMAW ACTWMMNACC CNNgSS AA^™ CGGGKCAAKA WAATTTAAA 
ANTCGGAAMC CGGNTS TACC I^cS AQGrrTTSCT NAAAGAAASA 

= ™ ~ s™ ssss 

=5 ™ = ~ SS= 



660 
720 
780 
840 
900 
960 
990 



C2J INFORMATION FOR SEQ ID NO: 306 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 223 base pairs' 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
;D) TOPOLOGY: linear 

in) MOLECULE TYPE: Genomic ON A 

^ SEQUENCE DESCRIPTION: SEQ 10 :; O:3 06: 

-o^'GTC GCAACGCGG3 C"TG"T""T^^ * ^ 

G^GGC i^r^- — ~ '^^--^-^ CCGGTGTTGC CGGTGGGG"^ rr 

^ «C SCGGGCTGGT TTGGTCATGG GCOC.C^C , 

CC3C~ C^— «C G 180 

--oj. ^ ^.^7 1 ^ CCG n 

INFORMATION FOR SEQ :l N'C ■ 3 C 7 ■ 

SEQUENCE CHARACTERISTICS- 
LENGTH: 4 : ; - ^^^o 
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(X- SEOL-ENCE SESCSIPTZON: 3EQ ID NO:308: 



^cSSS SSSS — « 

TACCGGCCAC CACC3CGGGC AT^r— — GGCTGTC CGGCTTGASC ACCACCGCGT 

SSS S™ « SF SESSS SES 

= ™ ~ iii — — 

GCSGATGACT GCAGCTCCC^ CGA^— Tr^T ^ ATNAACAGGT CGCGATAGCG 
CTTCCGCGAA TGCCGC^CG SScGC^ JS?"* GTCGGTCTGC GCCGCGCGAN 

AAC3ANNCAA AC=tS££ SSSS "^^^ ^CAGGTKCC 

^MAATTTCGC CKSAT^S ^ YCAAAAA ™ C GGKTTTTGGN 

GCAAAMGGKT rTGGK^CC ^"^TO NTCNGKGGAK 

3GGAKXGNGA ATYcSnA VC^GG^" -SS^ "CCCCCCGG 

"^■^ "aaaaaacsc cccaaattmm — — .™ r^™-^ 



-CCSNCCCM AA^KG-; I^SgC ^ SNC * SNG ^^CNRCC^ 



c 



VTI ON FOR SEC ID NO : 30 9: 



■ i, v SEQUENCE CHARACTERISTICS - 
(A; LENGTH: IJ36 base 2a -^ 
'B! TYPE: nucleic acid 
(CI 57RANDESNESS: sin;.. 
.'Li TOPOLOGY: linear 

MOLECULE TYPE- Jenc^: CNA 
X1 3HQTJEI;CE ^CRTPTION: 3EQ ID NO: 309: 
"AAGAKTCA GCGTCAGCCG ^^^l' ^ flll^ ~™GT3C~ TGCCCTTACC 



rGGGG TGGAATAACT GAANG7TA 



~NS* 

. CON 033 AAN 



" — - - " — - - S AANTTTO t ■ ' : ' ■ ; - ^ - 
_ J - ' J - — -J o ^. ^ . 1 

" 7 "" - J - J ^~C3AY .'TTSATGGCCC NACNr"WC"G 

.-w \ . 1: AMP v ; - _ 

TNSY'' — ' , . - ---.-IGGAAAA ACCAACMSGC 

- ----- - - • - .-u-^n ^ 3 I p . GTR5;TM r ' rCSr^AAA^" n-r-.. 

^^-..^ :r,ir,;^3r ;3:f;^:? K — co~-tnaam c; 

::T^rTMCTKc ::^v™r^ ^^^1^^ "'- : '™:-'.G3.v tncccaatnt wcmwcxccns 
::g:;cwkawta mga-:ct^ ^ff^'^ r,,rrATAAAA -- wot/a wcwx 

':i--rrcT>ttiAr — t:-:tamwcc ::cmccckcsw twc^'ckcsc- 

*n -CTT>TTCC; '"-'\ t a: .""■■■"•-* - .... " ^^^r.TWrwr 



:c2o 
:0 4 9 



,- T 3 A 3GCCAACCGG GAAAAGGGTG 12 

-u^. .GTGGG ^A~— ^ AC C C 3 G TATT GATATGTATT 13: 

"GTYPTGGA ZPS^.JG^ CAGC^GCG^ ^r;?!!^ A TTTGGGG AA 24 , 
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(A; LENGTH. 1036 ba.se pairs 

<3i TYPE . nuclei;; acid' 

\C) STRAND ED NHS 5 ; single 

(D) TOPOLOGY; hnear 

(n) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3 10: 



GGCTCTTCGC ACGTGAGGTA ACCAATAACT CCGACGCAG; 

GGGTACCAGG CTCCGCCSGA GCCAGCCGTT GTGCCCCCZC- 

GCGATCGAAG TAAGAAAC CG CCCCATGCCC GTCGCCAAGT 

ACGATCGT7G TCCTTTCCGT GGGGGTAATC GAN^TAnr* 

ZI^ G ^ TT:G ~ cactgacc -ac-aacggg CTGTGCGACA 

TT77GCGGGG 77GCNAACGG AATGANCG3G ACGCGCTCGC 
'^^rr^fZ -GNNTCTJCG 7CGAGATTT7 GGGSTTMTGC 
GCCAA..CYG AA CN AAAAAA TTGGYCCATY ARNGTYCTCM 

^^ GRCCC: — ^V,AAACGG CCCWWAANCC 
- ----- - ^ ^ 7 3 GC7CNCC3GG TT 



GAAGTGGTCG 


GCGATCTCGC 


SO 


GATGCGGACA 


CCTGCGATGT 


120 


CTTTGCTAGC 


AGGGTGTCTT 


130 


CCAACTCCGG 


CCCTCGATCC 


240 


GGCCGAAGGT 


AGCTGCTGT 


300 


ACGACTGACC 


GAGCAAACGA 


360 












i 0 




ATTGG TG GT G 


■:ao- 


7GAAS7ANCC 






GCCTCNGCAA 


CSSNAAYNC7 


5 0 0 


CCAAAAACCN 


AWTCC77KTA 


6 6 0 




jru^i^j. ,, r ■ , ~ ^ J N C : 3 1 1 

£ecuence g:iafj^gtek:.-ticg : 

A LENGTH. 106C pairs 
-3 TRANE EDNE3 ~ 



78 0 
84 0 



™ T „ _ TCMCM GGSCMMTNWN GGGNTGCSCC CC CNCNAAAA 

: : ;r?^ AAA c "~^ asktgggssc cggmarccgg GGKAAKKWWA 

hNT.m^NCCN KAAAAAAAWW NCANNMC~C~ Wr^rMr-— ™^™,^^, „ 

ARAAAATM T 7 CANA^TS^ "^w^ r"^^ GmT ™^ 90C 

K ^ . SWAKCC G G G NNJ JKKNN CCAAWKAAR? ^6 0 

iCCGCCN NNTWK G G TT7 J :o?C 



1036 
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awnccccmck sa^ggggrny ttmka ggg m g 960 

^NV__C^ ^-uuouGHA NAAYAAAAGK NG3NG2GAAT NTT?rrrr^ RSSSRNv— , n ! 

tyntcctycn ccnmgnrwwg sramntgkts ns.gggsggc asss**... 1020 



(2; INFORMATION FOR SEQ ID NO: 3 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 104 0 base pairs 
CBj TYPE: nucleic acid 
(C) 3TRANDEDNESS: single 
(O) TOPOLOGY: linear 

ill, MOLECGLE TYPE: Genomic DNA 

l xi) SEQUENCE DESCRIPTION: SEQ 70 NO: 31- 



; C AT CAT 'GO AT G TGCGGCTAAC GAANAAGTTA TGACATGGCG 
■oou\_rt* 3CNC GCGGG AMT^"""' , ^ ^-^^^^ 

_„ „ ____ 1a ' ~ -^^irtC^oC .GTGTNTGAA GCGTMTCAAC 

^~ 3C3T -^ttmmaac cnaac=cn™ cnaty—c 

ji .... ^1 --^-^^^ TGAAACCCMA CTNCCCCCC: 



G i _ CGACTTA MRTNTTCAAA AAMCGGMTNA GAA~*n ~™ llltT^"^ 
„ -»^-.J.< ^A^__!.C^P TCAAANTAMK 



:amggc ncmtnmttgm ggggcsnnas g^cccgggkk 



TCN_MoI'7TT ..KGGGGGGG GGCYGRTRMC NCGGGCCCCC GG C C CKKMAA 
' G G G G?< 
*AKG GT 
?K?vGGM 

ngwmg:: :gg:;nga:jp.ci; :;cnggg:;k?:; 



--sgtma^ mkgmannnct sgmscz^s 

~<j**i>.^ARN AAKAM5CMG GM G/J G 3 \ T 



INF GP.MA7IGN 



1060 



AA.-C3GCAC GAGG _ _ CAGC AAAGAG GTGA CATGGCGGGT iATGCGACV — c ^_._ 

jCAatacggg gatggatgan gggaanggak tctgg~~— *o 



--o oC-AAAGATG cgacggttaa g-tgcgc— <- r — 

TGTTNGTATG GATGC-GGaa CCGCGC-GAC w*C^.ow^ 180 

_____ ^^-*G rt C TTCGCTGGTG GCCGGGCACN 7 40 

^„:.'_^:;: ^^-sgtta aattgcstgt gcatcatctg gcaggctatg 300 

^--^ "::^r^:Z f ATG ^GGCTAAC GAANAAGTTA TGACATGGCG 360 

420 
480 
540 

GA/CrrCGGGG —^GGGNR— — ^J:::," '^77^ - — ^ : -^ANTAMM 600 

saaacntttg ~^-:~^t „ ~„ ^knnt\_j ^ttynwc seo 



72 C 



wc- „^ rv .„ — a-*^ GGCCCKKMAA 780 

-j^^^.^w^^^ :r-,^^ ACCCCAANRA 840 

9 0 0 

G3M5 gsc?:sc2tng^; 



960 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 3 32 base pairs 
EB) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 



AATTCGGCAC GAGAGACCGG GTCGTTGACC AACGGACGCT TGGGCGCGGG CCCCTTGCGT 6 0 

GGCATCAGCC CTTCTCCTTC TTAGCGCCGT AACGGCTGCG TGCCTGTTTG CGGTTCTTGA 12 0 

180 
240 

3 n n 

332 



CACCCTGCGT ATCCAGCGAA CCGCCGATGA TCTTGTAGCG CACACCAGGC AGGTCCTTCA 
CCCGSCCGCC GCGCACCAGC AC CATC G AG T GCTCCTGCAG GTTGTGGCCC TCGCCGGGAA 
TGTACGCCGT GACCTCGAAC T^AOTOGTPA r^rATG^GG arzzrr 
?TTC SGCTTO TTCGGAGTGG tggc 



,2) INFORMATICS EOP SEQ ID MO: 3 15: 

' : 1 SEQUENCE CHARACTERISTICS: 
:'A; LENGTH: 962 base pairs 
(31 TYPE: nucleic acid 
:'C: STRANDEDNES3 : single 
! D) TOPOLOGY: linear 





MOLECULE T 


- FE . Genomic 


DNA 










SEQUENCE DESCRIPTION: 


SEQ ID NO : 3 


15 : 






■ — .jGCAC 


RAGTCGGTCT 




AATGCTCCCG 


CGAGCACCTC 


GCCACTGCAC 


■5 C 




AAAATGTGCT 


> — "U-V X .J X »JW - 


AAC G AG C C C T 


TCCAGACGCT 


w^-\C ^_GGCCGC 




" 23CTGATC0 


■j\-AAC jGCj2 


■~. vA^_ GGGACT 


_ «_ ~ 1 ju/iA'_ ^ 0 


GGG CTGACOC 


G C C GG C G 


180 


j j * G'oCTCTT 




^ AAC GG C C 




GAACGGAACC 


/w\CGGCGGGG 


24 C 




JLji w JVJ» ^ -3 


GG AT TT CTT C 


uCa-CGGj gc 


ACCGGCGGGG 


J O ^ o „ GT 


300 


~ j> — Av — Aja_ Gvj 


' — AC 0 C G*j 


ONGCGC 


CCGTNGGGCG 




jG'jCTCC SGC 


36 0 




• — AC o 'j w oG-~ j 


- — — -jGCTCAC 


~ CN'j TT G o 


j AC GC GGGG A 


CC2GTNACCC 


42 C 






vjvaAA C C G C GG 




ACATTAKACC 


COGCGGNACC 


4 a C 




J GG AA 2 GG N G 




AAC GG EGGG c 


ICGOGGAACC 


GMMGGSTSTT 






mGGNl 2AAi"-T 






/-vT'JG' jKTGAMC 


2PSATGSNCA 


r> 0 i. 


" * V TTMA G j AA 


iytncicant 




- .!! 


AGRAWNKNG7 




s , 






- v/Vfvn- . ' TV 


- - ■ » - j ^. . 


cicccngm 


G P. 3 * * CI INT? J * - 






TCCGG^NAAM 






;ggtkmnnna 


'_ _.. , _. ._ AAA_V 


r 


:<NTLNNKC3CA 


ANGGGKSNGC 


irKirNMMNSGT 


TTTVCKNMRA 


mrnwtyknkn 


NTCNGARSPN 


3 4 0 


naa>i cnns rrr. 


JIG rUCOTNKAA 


ARN'mTWKTN 


knscnnncnn 


GRRNGVRGGC 


CIOMKGSNMNG 


900 


'.'*C '.vKI 7 A W P T I G 


nngsncnckc 


. J . i r-MN^vAAAA 


/^wi '.j j'. N^;\>; 


:;smknekkt:c; 


NRGGGGGGGO 
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(ii) MOLECULE TYPE. Genomic DNA 

(xij SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 6 : 

AATTCGGCAC RAGAAGACGC CCGAANGTTT GCGCTGGCTC TACAACTTCA TCAARGCGCA 
GGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 
GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGT 
GCAGAAGATG TCCITCGAGG TGGCCTGGAG GATTTTGCAN GCGACGCCNG TGACCGCGAC 
GGG TT77KGTG TCCGCACTGC TGCTCACCAC CCGCGGCACC GCGTTGACCT CGACCAGCTG 
CACCACTCGT GCCGCTCGTG CCG 

ID) INFORMATION FOR SEQ ID NO: 317: 

(l) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1034 base oairs 
(3) TYPE: nucleic acia 
'C) .-jTRANTIEDNESS : 3inc;ie 
(D) TOPOLOGY: linear 

(ii: MOLECULE TYPF: Ger-:c DMA 

ixi; SEQUENCE DESCRIPTION: SE2. ID NO : 3 1 " . 

AA7TCGCAGT GTGTGTGG 73 GCG770AGAA 3AAGA7GATC G73AACATC3 CZAGCGCZGG 

I™^;^ 3 ;^ C :^ 3T3A 2™ ~-^~catc A7CGGCATAC AGCCGCCCGC 

■ ACZCZAC ACCAC37TCT GTGAIGTGCG 70 GCTTGAGC CAAAGCGTGT AGACRAACAC 

™^"" C3 AC3GT3AC::A 3G3CCAGCAC 77I73CCAGC AGGTTCGTGG CGCA( 

L CAGAAGAA I GAG AT CACCS T CNA 1 G T CA C 



"AGTGC7A ACGCG.CTCG GGTCGGCA: 



jcnaccagtt GAGCG.C.G 3C3C3GGCGG IGGCCATCAT 7373C73ACN ANCGTGTTG 
::^: GA ^ G C2GAT::AATG -GCIGCGGC 7CGTGCCGCT 7GTGC03AAT TCAACTCCG 
.^^.^ -GGNCGCACT 73AA7C3GG3 7GAATGAWTG AATTTAAAC C GSTSAACAN 



rt.C . ALATAj 



. 7TGAAN 



- ^ - - - v_ ~ jn 



■ .^AA.-V Nrt S AA * 



V-1AAR G G ! d' 7 7771 I A VTNR R ?7«* r ~ 
. 7 GNS T J . ; TT! JGT*:"ft AG GNP WKGNS 7 



1 M - ^-L G . H i : - a - o Da . 



SO 
120 
130 
240 

300 
323 



120 
180 

'CAT AG 24 0 

„ 300 

^ ^i: 73 ITT CATC A ~™ T3T33AT ATCGGCGTCG 360 

4 2 0 
4 80 
^40 

:^H>vrt.GGAAG d u C 

T>IV7AE7 ::AAAAGGCC7 7R7 77ATNG7 CCTCCACAGT tgu 



^^^^ — -™T^ ARNNCNGGCC 3KCCCTTCCX T!rTYCCC707 NTCCC7CNS7 ''8 0 

,W, J '^' --NmT-!Ni ..Tw* ACGGGGGGCC YTNGGGKCRM Tvn K7' TT 7CG GCCCCMCC: 



J4 I 

-* 0 ■_ 



WO 99/42076 



F , C"I7l. , SW03268 
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AATTCGGCAC GAGCCCACAT CCGGGGCZG'C TCGTTGCA^G ACTCG^C— — 

RAGGCACAGT CGCTGGAGCG CAATGTGTTG CTGACCGTGC TGTCCCGG^ ~t ^ 
TCCCGGGTCC TOTTGACCCA CGACATCGCG CAGCGCGACA I^CgE "° 
GACGGGTCGC CGCGGTGATC GAGAAGCTCA AAGGTCATCC GTTGTTCGC^ CACA~CA— 
TGCTGCGCAG TGAGCGCTCG CCGATCGCCG CGCTGGTCAC GAGATGCTCG ANGAGA~~AC 
CGGGCZZCZC TGAGTGCGCC TCCCGCGAGC A 



(2) INFORMATION FOR SEQ ID NO : 3 1 9 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



;:ci; SEQUENCE DESCRIPTION: SEQ i- :;0:iiy : 
.AATTCGGCAC GAGAT C G T CA CCCTGGCGAC CAGTGCACCC AGGCCAC3C: 



^g-^— amtncaggaa NTYCGT3TGG GCACCNGCTT CTTCCGKTRT 



:ytaaact tccnatsttn CSGCSGGC 

SMMAA ATCCCCANMC .AAA 



:tggcgttnc GNCCGGGCCG ntcttnccaa 



240 
300 
331 



.ctgatgggc cagaagatgg accacgtgct gcc : p— - ;:r.:;: LG 60 

r::™:: ^~ GC7AC GGCGA ""ctggtg ttcggcatca ccgctgacta ibo 

Z::::™ — = AGCAGCTGGT CAACGGTATC GAACTGGGTG TGGCGCGT,- ' 40 

^wu^u... ANCGACAATT CCGTSCTGCr* GT— AC" a~ *-„~„„ r „-^ " 

„_ rr .„„ ^-^CiAu^ ^T^^S.AA GCGTTCATCC 300 

^^CT^ .^NCG.C.C GCGGCSGGGG CGGCCCTCTG TGCCGACCGC -GAGC™ 

r:";:^;:: "S^™ ^ 3 -^cc= CGTGAGAAGG TGGGTCGTGC GCAAG™ 4.0 

_.o^<_ A.CN___ ^GGCCATGA CGCNGT3CTG TTCCACACCA CNTSNGACNC 480 

540 

600 
6 6 0 



~s c 



---o ju.;. * » jjGG GCSGGGNGGC GGCC y IAWNC'~' 

:rrr ^ JTCT - 3KrNc - j - ^csgg^cg nchaanscan cccttt-ggc 

-" CAV,T - - iAA ™AKCGSCN AAYCCCAAGY TMMGKCCYCY ^AAAAAAAA 

■"- Jfv :J ^ ,J -CCCAANTAA ATT~r~NGGr — - - ^^r^,..^-- — . „ 

tv , , " -----^jw^i^'o _~n<AN , ^rYMT TTTMCCSNSS 

- - ..j.nu«rtAfl_' "GGANCCGGG >LAAYTyMTV'; \*AA'-'~ " ■" ' 1 'Mim W -«m^ ~,,„ >T>T 

::^!^! ;^" : " SNG ;iAAAAANRSN , 60 

•'~ WN - S ' R - —sn-nnnn nnncmcsykc cskaankccc cgcggggggg :02 c 

' n ^ >' 



TRA2 «TJ E D N E S G 3 : ; - 
0 F G LOG V 1 :nea: 
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GCACCACTCG TG CCGCTGGT GCCG 

(2) INFORMATION FOR 3EQ ID NO: 321: 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH: lOio base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:321; 



^ ^~ - <j a .-vlj ^ C C C ; 

uvjrt-ji „ GGCATCG' 

ATAGTGG2CT CCAGAGTGC 



'*« ^ GGGA AAC 

AATTNTTCOG GAi> 
AG5TTTTGTT SYT 
MNAASCCNKT CGI 



GCTNAACACC 


ACCCCGCGGC 


GTTGCTCTCG 


TGACGGGGCG 


- 1 T G GAT G GAG 


- -GATGATGA 




w.-lG >j A C G C G T 


C G T G CAM TT G 


- * « j v_ o x vj CT C 




-wrjoATGAAT 




GCGCACAGTG 




CGCCCACNAT 


ACANCTTG5C 


AT L GG ATTTT 


NACTTTTCTT 


NNAWTAACTG 


CCACCTT3AA 


GGGGTTGTTG 


KCCGGGSTTT 


YSTNTTCCCC 


GAAAC C GAA G 


T TTNT i rTY^: n N 


camggsggtn 


AACCGKTMNG 


TSTKTCNNCG 


G GGCG KAAA W 


jGGG G G G 3 G A 


:^mTNAAAG 


3NMN3C3NGG 


GI-CfNSGGSNN 







60 
1 2 0 
13D 
~ 4 0 

300 



W-A.CGGCAAAT GC GIT Gat 

TA: ~-- C ~ GCATCGCCTG GGGGATGAAT GGGAACCGCA SGATGGCGAC 360 

J "~— - -ANCTCAGG7 TTGGGGr— r.r~.r »rii~v* — T ^^ :cc GGTACTCGGC 42Q 

GGGC ACNACAATCG 430 

\NCG CTCAANCCGT 54 0 

ICGK CCGTGGNGCA 60 0 

TACT GSTAACCCCG 55 0 

j NAN GGGCCGGCCA ^2 0 

WAA MYMTTTYCSG 73 0 

IAAA GGGSKNNKTG 34 0 

TYGN GTGKFJCNKS5 90 0 

rS-.'N GGTGKCCCNN 96 0 

"rGGG 1010 

IN FORMAT 

LENGTH : 1 J 1 - : - ;air; 
TYPE: nuclei: - tC — 
3 TRANDEDNES S . z ir.aic 
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escriptton : 




N3NGGGGWNS 


NT CAY CA YCA 


i CACSGGGY7J 


v . m „ TG CGG C 




AXG AG GG AMT 


CKCTTvGCNOO 


G ~TGTGCAAN 


CGACTCCACA 


AAAAACCGCT 


G TG TG T A YYT 


SCCORAAATR 


0 GG TX.TTY 


CCRATYCCCG 


TKTTGTAMCT 


-XGG GTS R 


— -jvjj-i ^ „ G/\A 






- ^S'jCjA 




CGGrC-'tTTGGN 


GCTG3NCGTG 


^j-kAATNCGR 


KY C 1"/ CTCC K 


YC 3GTACCCA 


AA Y CT G GG T A 


TG GT AT AC™ G 


G C T G Y CMMT K 


TTGCXGG3GT 


>- ^ N AA TTT A 3 


-AG GAG G GG T 


G K T G G G G V C G 


AGMCCGRAAA 


AAArLAAT AA T 


FJAXAAX G G T G 


OOONANTNCN 


ATCCGNTNCC 


MSCNCCCCCA 


G3GGTNAAGK 


7AAANCCCOA 


TAACCTTNCGR 




_ ^ _ CNCGGGG 




CGGGXWCCGT 


■j " — w _ CNAAA 


- G CAAAS T A 


ac^tcoggc:: 


CCT-TCCCSNAA 




GGGCAANCCO 




^_ ^ 




"C2-3NCCYTY 


3rr:g;aggma 




- —'"<ruu tA . " 




AGCCZVCCCO 


. . ^ ORAA 




: j .kantnccca 


C AAA CMAAAA 


AMCGGGC5Gr< 


-"".MA ^LGGGGN 


N T MCCCC3K"J\ 


AAMGCAMW2Y 


".GG -T'! AAAA 


j'-j/-tAGRAN CL T 


- - - ■ — i ANANM 




C3 







720 
780 
940 
900 
960 
1010 



^TlU: C " 3 GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 
kGST^.GTT SYTGAAGGGG GAAAG GGAAC TTTNTYTYYN AACCSOtNAA WYT^TTTYC SG 
MKAASCGNKT CCCC7TTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 
NCCCCYMANG GGGGGRAAAA TS TKTCNNCG GGGCGKAAAW ACCMMMMYGN GTGKKKNKSS 
GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GS7GKCCCNN 
NTTTCCNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 

(2) INFORMATION FOR SEQ ID NO;323: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 1092 base pairs 
(3) TYPE: nucleic acid' 

(C) STRAND EDNESS : single 

(D) TOPOLOGY. linear 

:n) MOLECULE TYPE: Genomic UNA 



CGCArtCTTGT MAAS AGATCT 6C 

CCAATRAGGT CTRATAATTY 12C 

AAGGCGCCG3 TNTCAACWTC 180 

AAAYCCCCGG TGTTGGAYCC 24 0 

TCCGGKWATT GAMSTCRC3G 3 0C 

AYCCRATAYC CCATGGCCTG 36 0 

GYCCCTAAAX GCAAWYCKGG 42 0 

TCCTTCCATA CCNAAACNCG 4 80 

GATNYGGAAA ACCNZCGCCN 54 0 

TKSGGAAYTT CTYMAACCCC 6 00 

3YCNWNCAAA ACASOMTTAT ^ b . j 

CTTTYTGGOT ~ ON A G A K AAA ' j 

C3AAACCTTR TCMNACONC:-: "3u 

TCAAAKKI-'G G 2 AK >r -vT JN G N G 3 4 c 

ON AAC AC 3*0-'. CYYTT C G 3 AAA -> 0 '"■ 

AAAAC-fYNGr; oGCCCCCCOC ^6C 

T C C CNYVTR 5 W 2CG3 WG M GN A ;. : 8 3 



3 TK ANT) EON r_ 3 3 : riq 1. - 
TOPOLOGY : . . ::e ^ r 
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TCCAGG CACCACGCCC A3-TGGTGG ACAATGCCCT GGCAJCTT-~ — PAMP-rrr 

c?SS^ £5^: " — ^ ^ -SgSS 

;: U - CJC " KTTTC^Gou GTNTCGGCAN AANCGCACCC WTGGWTTC* P M -CNCrr^Ar- 
GGGCGGACAA NTC3GGTTGC AATTTTGCRA AYCGGGGCCG GGATTCCSCA aS££- 
GAAACTGTTY YCJJAAMar-" ^sir^^,. * UiU AACuGGTGCX 



420 
480 



840 
900 
960 
1020 
1080 



GAAACTGT-Y vrPAAMir- Z ™ A1L ^ UCl GGATTC C S CA AACGGGTGCC 
j/iahlioUY YLRAAMAC CG GGAKCCGCAA "n^rr-rr^rn n im r. 

GCC GCCNNGG GGAAKAATWT MWWTOCNSGG GGGAAWTTTT NTSTGGAMCS 

S£S SET 00 ™ AYT " = SSSS 

TS^SS GK^NWNWAKR RAKGGYKNTT 
GNNGNNNAAA ACNK3HMMAC KcSScS C^C'™ MGNSGSYGGG 1140 

~ - — — m ^LTCCNCCC SCKNKNTGTC TMTCNMYGSG C " 
INFCRMATICN FCR 3EQ ID :;C:32 5: 

ii) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic acid 

(C) STPANLEDNESS ■ single 
* E ; TOPOLOGY 1 linear 

i~! MOLE™-Z TfP" G^r.OT.i; DNA 

ixi; DESCRIPTION: 5EQ tD MO: 325: 

ttactacctg 50 

••— qAAG'—"-" - G ~~ ~~ -"-'" JV --•■ ,, - •--^--ouo. vjGATGGTGG C CTCGCICTGAK 

.^CAGGTGAC ~^r 3 ~3 -"^^ CC'™ 3 CCKCAGCAC ATCAKCGCCG . ^ 

TTGGC^ACY C™C~A ^^C~'" ^-I^ 7 L ' CAAAC — * -CMCTGCTG ; 4C 
: iy " LJnAC - -V.-.>G„MMA CCAKMAACXA NAACCGTTGT 

^ Z-^; ^gactcka ill ' 

GGGAAC5GS;: — ^m^" J^TACGG GCSGSCCHAT C AAATTC GCC 480 

J^'*^^' — . ^..caaaacc tycgaacggk atccttckcy v;^ 

„___ M ^,~Z Zl *^""~' v ~ - - ■ J,J ' J '' r v -*M.SG^0AATA "CCKNSCMNT CCGAATCCAA ' ^ ' 

,~ r ~™U;T! ~ .^^^ AAA - ;GGGv ~ —-agsnmkc ^coamnant 

, v -^„^^;! ' i: " C::1 ^ :;MAATSAJ ™~ tscyaccmw: 

^;:.;;::; f ;:"" ■■^■"^ka*- ^atoa:.ikwng ggnkck^oj :jaamaccsc>? ^ : 

r — <^<^- J "^^urt nrSGRMKMCT OTMT^CNCT 9 * 

r^!;: ^— :GG — : maama^a otakccgygn 9 qo 



3G::s—*r~s ::;::/ n :^ L jNMSK:rr::KC ?:5ctc ^™ ^onmkchn 

--y^^^ '— — ^^ -ac::cacac" mgwc-y™— *knnmkc\k^ loa :: 

l j 9 
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(C! STRANDEDNESS : single 
(D) TOPOLOGY, linear 

(11) MOLECULE TYPE. Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 6: 



CCSAGGATCW 


ANTGCGGCCG 


MAAKCTWSTM 


CASAGATCTC 


60 


AKAKYSTCGT 


CCMRACCCGG 


CAYACWCCWG 


CNCGCCCCWT 


120 


ACCGTTGGCC 


CCGGCNCGCA 


CCTACACCAC 


CCACGCCGCC 


iao 


ACCCCGCKTT 


TACCGCCCGC 


GCCGCCGGGG 


CCACCACCAG 


24 0 


CCGCCGTTGC 


CAAAACAGGC 


CCGCKTTTGC 


CACCRA 


296 



(2) INFORMATION FOR SEQ. ID NO; 32 7: 

■I) SEQUENCE CHARACTERISTICS: 
-A; LENGIh; 1U7 3 case cairs 
!3; TYPE: nucleic .icid 
(c; STRANDEDNESS : single 
TOPOLOGY: linear 

(n; MOLECULE TYPE: Genomic DNA 

■xi! SEQUENCE DESCRIPTION: SEQ ID NO: 327 : 

ngngsgnx>ty atcatcwttc tgcaccsngg ktcwattgcg gcogcaatct tstmnasaga 
;:I;^; C otcamgar ca tctgcgcggn gaatgtccaa awgtcwktaa cggcmatcgg 

-..o^Gs^A ACCACKCTRT SCAXATOCGG GICAMWTYCA AACCRATTAT TTGGGYCGAG 

AAAATTTMCG CK* t, GT~lASC\ , _ „ , „ _ „ ..t- 

" ^"'^^ HU ' J -^- J ^.^AS^ CAGCCTCTRA ACCGTAAATv 

^Z^^: ^Z;™^ ASCYCSATAA TSCGGCCCGC AMCCACAAAA GCTGANTNGT 100 

- ~i + . „.JCRAA NCCoG^ YCCC GRAGGGGTSA ACTOC3GTAR ^CTTTiTTCWYC MC^TRA^A'* 

TAAACCCOCC G0GNTCNTC3 CCGC32CCAA A7YCYTGCCC >.TEGCNACCA YCCCJU^CTr* 

03GTATGGT3 RAAN CAS T 3 G SCRAA'CGGT^ MCC3TAC0:'C TGGCTGATYC 'TCGG™!^ iHU 

ATCMWCNCCG TACCTETTAA AATTCTTT ~ ' " ' " * — ^40 



ICCAMMGGGG CYCCGAAKXT CNACNTGGKT MACCCCTNCC YTTGAAS1 



V.n":™U\AAAA rwrrr-r: 



-AS ANACC3GAK:: CCCGGAAYI3 VTTAGGCYTCN TGCG003TTA AATTKGNCY 



- ■ — ^ 1 — ^- - * *■ * ■ - - - j :-'J .AAA 

•A/ v/wYrsTI G GG '-!M C ? TEA GN7 

J0G0RC3 N AA AA G Y GTI'GG 



TV 



WTCCC-UNKT? NS UN'S GNCG 
INFORMATION FOR 3E 



:^3i*jrtJis".T YNYCNOCAGA ANC 



SEQUENCE CHARACTER 1ST 
LENGTH: 116- ca. 

JTEANLEDNES.; 



60 
120 
180 
240 



"3 D vJ 

^2 0 

ILLCCUG GGT033 STMT TAAAMTTCrC GCCKSCASNG GAATY~YKSG -go 
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CCCGTTC TTMMMTTCAY TCATTCACCG GGMTCTAG7G CGGCCGCAAK CTTGTCXACA 



60 



GATCTCGAAY TCGGCAMGAS ACAATSTCGG GTKGGGCAAT GTCNGGTGGG GCAACTTTGG 12 0 



ISO 



GCTCGGRAAT YCGGGGTTAA C3CCGGGTCT RATGGGTSTG GGTAATATCG GGTTTGGTAA 

TGCCGGCAGC TACAATTTCG GTTTGGCAAA ATATGGGTGT GGG CAATATN GGGTYCGCTA 240 

ACACCGSCAS TGGRAATTYC GGTATTSGGT NACCGGTRAY AAYCTGACCG GGTNCGGTGG 300 

TTYCAATACC GGTAACGGGA ATGTSGG7TS YYYACYCCGS GSAACGGNWW YTTNGKTCCT 36 0 

TMMCNCTSSM CCKSAAMTSM KMGGTSTYCT MTYCNNGGAS TAMTYNMCCC C CG W A YCKSC 4 20 

WAYCCCTCGT CATYCCMCMC SGSGYCCTCA MNCCACCYTG NGYYCCCTCC MKMTCYCAYT 480 

CMNTCCGGTW CCTNTMMNCC CSCNCRYCTC AMCNCTKSGK CACOIATMYC CSACKCHTCT 540 

MCYMCSCAKN MTTCCCCTCN CCTYTNNCCA MCMC3 CTCTM TCMAACTCKC CCGGYCKCNC 6 00 

MYCTCTCKCC AYNMAACCKX TYCYWCNWYC YMYCKCKCAG WYXNMCTCCW ACTCTMYNTT 66 0 

TCTCTCNXCC CMKACCXNTT CTCWCSCCCC CCACAKAYMC YAWCKTMTCC MCTCKACSCC 72 0 

CYYQJNYCCM NMCWCMTCWC TWNAKCANQT TTCTTCTCTC MMYMTMA CXC WCNNTCNCCK 78 0 

SGACCYTCTC ACTKMKCCKM TCTCCTTMCX CCYMWCNTCC MKYNCCCTCC NMTCMTCKYT 84 0 

CCTCNCNMRY CYYY AX CAK C NMCTCCCCAN KMCAKCTKCT CCCCCAKMKS ACNCXCCCWC 90 0 

CCTCCTATCC WCTCTCWC; : ATLTCaCTCV CNYCMYMKMC ACNCKCYAYT CNACTMNMWN 96 C 

ccancnctct ctnyctcacx acgt/ckcck ztackcijymc nrwctyrcc? cxkccnccrn 1020 

C 1-12 IM CM KCTM CTCTCCWMKM TCCCWCCCAT CTMMKSTCTC WCTCNITCCCT C17XCCYNYNT 
KC YTYCCMYG CTTCKNTCMT MCCWCCYATC TCTMKCCTCT CWCACYMCAC WIOTACWNCC 
ACTCTCTRCW CKCCKCMCCR MTCTC3 



103C 
1140 
1165 



(2) I NFO RMAT I C N FOR 5 Eg 10 NO: 3 29: 

(:) SEQUENCE CHARACTERISTICS: 
A ) LENGTH : 1 2 .1 0 b.^ s p Pairs 
(B) TYPE : nuclei;; ncrid 
v 2 ) STP.ANDEONES S : : ng 1 
(0) TOPOLOGY: linear 

MOLECULE TYPE: 3encir.u UNA 

;x:; SEQUENCE DESCRIPTION : SEC IE NO: 329: 

NGNGGNNNNT CWTACATCW*: TCTNCACC3G NGtfTCWATTG CGCGCCGCAW NCTTGTMNAS -5 C 

agaat _ - cnn aayt 2 ggc ac anatgtcttt 75 tmt aktg t ggcggggngc cacgccktat 12c 

2tgyccctgg gytracccaa 2ccc0cggc3 cgggccracc aggcsgggra tscaggccgc 13 0 

2 gcggccgcg 2 cog™ at a 7 eaagcgccgy ttttktrata acggt3ccgc cgccgggtra 24 0 

ttacgggcaa aaycccpiktt ttcggtrta7 aacgctaatt 3caaccawtt 

aaaacicect tsytattttn tgggetatsi 2-gytgcttcg gcaaacg 

. — /.tccg2 ;g2S2cgcon aaaaaccac: aatyccgytg ggggtgkycc cmcaggcsgt 4 8. 

ijccycgngy 2acctggc2a aayyceeawt axa7t3ggtg z cyckt5cgg cteytgggcy ",4 

:;aaaaggggc atggcsccgt tyytttacct caayccccna ncantwacct ytccscccgc 66; 

ggggncanaa 2gsttngctc igsggnaxc: tkstmcccgn atcnaaaggc cngaatttgg ^2: 

7yy537ycna attwtwkr'.ky ec c2wcntt0 y.aaaaakcca aaa3akccck ycncammyxt 7 b c 

nggggtyssg gcckcictte s:r>rttaaac2 cyccccaaaa yynsgggkkt tccgcynsat 94 : 

. J ^CA. — NCI:'. GNGGG'JOGNA /AAAAAAAA/ TTTYCCC AAA ATCCCACCYY TCY10TXSTRY 
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(2) INFORMATION FOR SEQ ID NO: 33 0: 

H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

NGNGGGKNNA TMAY CWTCTC ACSSGGTCTA TGCGGCGCAW CTOGTMAASA OATC-CNAAv 
:^ G ^^ GCATMTCMMC CATATATAAC CATTGCGTCS GYWTGCAWCT CRAAWCTGTC 
C^.CGKGCCG TTKTACRAAG GTGGMWTGYT CWTY C CTRAA SCr~" v ^n~~ ^^^^^^ 
-TKGGGCTYC AC TTTAAC S G 3ATKSCT3C-J TTKTAYCATT RATG CAAWTA OTTCYCaAV^ 
: . : ^AGo^C SACSGCWYCT TTTYCCGCRA GRACAATNGA TTG3AWYC3C 7YCGCRAGGC 
...^■JUi ACGGGGCNCG AAAGGYCCGC GCAAWTSCCT GGKTCAAAAA TGGTGCAAAC 
AAAMCTATCC C CGG YTTRAC CGCAGYTAKC ACAAKAAAAT TCCCVTGGC" GCAC-AWNN- 
T.rCRATCWY CWYCCCCACC TTRAACTTGK YTGCSGTATT GCCTKCCTCC CTCRACAGC-' 
iCNCCCKTCA AACGTGCGGT GACTCCAACT GGTCTGGYCG AASGGGGGYT CAMCGGACAA 
.-^C.CGRANN TCGGCAAATT TTCNCCGGCC CYCGGGAAAN GK7GATMTTC TCSNAACCSA 

CHGG^amrrw naaccctgaa csssgshkga mynsccsgga anttttcgct TYNGGGCGRN 

AAANCrr... AAGGTACCCG KGGNGGGGKG GCCYYTTGGG AAAACAACCG CKATTGGKTT 

,!^^ A l" 4 ' rNTT TKCNCCCCCA TTCNSGGGGG GGGCCCCAMC CCMMCTTTTN TCMSCNMTYY 

YCYYGGGAAT TNYTCCCC5G GAAYYGGGSM CCKGYCCTAA NCCC 3MNWGG GKYSTGSNAK 

GGRATMAWWT TYS TTTYYMC CCGGCNKCCC CCCKAKMCNT KGNTGAACMA AAAKCSGGGG 

Y ^ n< '^.r "- Gc - 3 -^- ™aaamman ggggkywtyy ckccngscnw 

rv,GG3T T..GG;. 3 GGGS3A7YKG MACCCCKTMT AYG3GGGGG7 NTKTKYCCCG 



6 0 
120 

13 0 
24 0 
3 00 
360 
420 
480 
540 

SA 600 
660 
720 
7B0 
34 0 
900 
96 0 
L 0 2 0 



:> information for ilq :o nc:33: 

LENGTH: 12 di oase rjair:, 
TYPE: :.uclai: acia 
3TRANDEFNESF : dina.o 
~*FnL.JGY linear 



------- j - AGRGAGG 

- j IV-vAt*^:' ffaaaaa 

•« — _ .- v^j v : FL A 2 GO F F A, 



7AKAYF>T-T NFAFF.GGGA TCWATTG CGG 0 2GCAATCTT STMAAGAGAT 
-Art._ Wii :rj K^TGRASCGO AFFAG2GRGA GCTFGCSGKT 

^-u.wGck: fggtggcaat scgaacogcg cccgaaaacn isc 



2 F F N A TA 2 G Y I 
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SAC7CC7TTN . . CllGGGX^. A7ANA7YYTT YCXTNGGG3K TTCCGMTCNC AMMAA T K r rn C 
RGGGKAAMCC AGKNTNNTCC YYYYC7CCAA NNTYCCYKGG RMCYNNYYCY TTAAANRASR 
SAACCCKSGG GKCYNCTCSS TARCCCCCAM KAAAATTTCC CCCSSKTTTC TY^NNKXMRW 
GCCCCCGAAM A CTMTW A YTT TCCCXCGNNN TTTSYCCKCS KCAMWMWMTG KKNCTT— 
YC3CMATAMA CTTNGGKCCT NTCNYGSGCG CMAAANAAGG CGCGSTTCTN TTCWMAMACA 
YNTSGNMMMA SAAKAKWATA AWNNTRKKYX TKNNCCCNCC CKCKCTTSNN TNKCCMCSKS 
GGGKNWNKKR GWCTCCWCNC CKCCCNCKNK CCKWATMCCC CCCCSKCCGM NOWTT"K- 

ccc 

(2) INFORMATION FOR SEQ ID NO : 3 3 2 : 

-i) SEQUENCE 'CHARACTERISTICS : 
£Ai LENGTH: 1069 base pairs 
(3) TYPE; nucleic acid 
v'Cl STRANDEDNESS : single 

— . - — - , ^ w ^ ^ . _ ^..icax 

,xi, SEQUENCE DESCRIPTION : CEO IE :;0:332: 

GGGGNNKYA7 MCAYCWTCTS YACSGGGMNC TATTGCG3CC GCAWYTNGTM GAS AGATCTC 
■GAAYTCG^CA MGAAAAAAGW GA7G7GCTGG ACCTTMCCGC GCGGGACGCR ACCRACAAAG 
RAA3CGCGCC ANAA7A. . 33 C7ACAX7TGG TCACATATTT ACCCAATTMT AYCAGGGAVT 



_ ^CCRACIG CACAATCCCR ATSKTGGTTT GCRAACCCTR ACCGT3CCC~ 

MY , / 73 7 7?. A S . . GAAC7AG G G C RAAAAA^V 7 3 G C CRAA WY CTCGCCCTGA NTCCCGCTCS 

g_gcnaataa CTAGGCCCAT 7KAACGGAAC CGGNGGCCSC NANTTGGCCA ACAGGTCCTR 

^r^ A ?^ r GC CCrASYYCGG 77GGW7CCCW TTYCACNCCC TNKTCTCKTG CCGAATYCGG 

WTCCRATNYC CCWTGGGCCT 7K77KY7^y: KYCGGTNCCA AWTCTNGGTA TNC7ATRGECG 
7 737C7AAA7 3 CAN AT' 



. - - — _^u-^rt^_w, tA A7TN - .T 

- 7 AACC3 2AACC 7777 7AAJ' 



720 
780 
840 
900 
960 
1020 
1080 
1083 



120 

iao 

240 
300 
360 
420 
480 
540 



_ -VC7GG3NTT7 NAT7T AMMAN S?J>CGG77C7 

^^if^'"'" ~~~^ n ™— - AAAAAT3ATN ATAATAATGX YGSCTT7CAA 500 

G2CC7 CCGA..CRW7 CSG™?CAN7 TCCCNGNGGT 7AAGKTGGGA ATTTYTNAMC o6C 

G7777 :IA..7SGG:;A AAAA GGG7CTCAAA 2MNYTTTTTT GSKS3NTC3G ■ ^22 

ggy?7Y.t:;aa ACMCGGYCRC RCCGGAAATT ^0 

:;ttttytyyt trccsscsmn tngssgggnt jhc 

-I ^^Z^l 3GGGGWYCYN '^CCRMNTTT 7777TTTTTT C C G 7NNMAAM : >0C 

^ *\ASMCCCCEC 37773CNSAA ACCCTTNAR 37T7TY7MMA AANNWYNNGN 96 0 

77 MMNAAAAAAY V77 7777NRM A~7M^:;7-";a Y^-~7~GG3IJ N 

7^ ::?majy.t: ^-am^Itp ;AMr:sMT— y.:;?,:,xw;Y 
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YCCCGGCAAA 

CCGGKTNAAG 

MCNCAGGGGT 

AAAAGGGCTG 

CTCTCKTGCC 

ATCTGGGTAT 

T C CAAATTT A 

AAATGAATAA 

NMNCCCCCAG 

KAAAC CCTYC 

SCCAAAATCC 

TTNTGKTTSA 

GGGNATXGYG 

NYWRTT TCl ' N 

AAAAWCCCCN 

AATMYCCNCC 

3AGCKCCSCG 

ZZIZSZZZCKS 



GGGCCCACAA 

GGACCTGCCT 

GTCAAAAAAC 

ATGCCCCCAA 

RAATCCGRWT 

CCTATASTGT 

CCANCAACGG 

TAATAANNGG 

NGGKTAGGTK 

YCNGGGTCWW 

MAAATANTTT 

ACCCCAAAAC 

3 CYTNTCTTA 

kccccktnmr 
nnnaraktnt 
rasanmcasm 
mnnanmknck 



CCTGCAGRGA 
GGGZKZGZSC 
CGCAAACAGG 
GGGGGCCCGC 
CCRATNYCNC 
CCCCTAAWTT 
TTTCTTNCAT 
KCNNTTYCNA 
GGG AAANYYC 
TYMAAAAAMA 
YYTGGTYCNA 
YTTTTCMNAA 
TKTKYTYMTW 
NNRAANNGGN 
TTMKANNRMN 
MMGGP.GMRS 
CTTTTTKCNC 



SCACYCRATG 
TMCAAANATC 
CACSCCANCC 
NCGCAACCTT 
CWTGGCCTTK 
CCAAATCTGG 
NCCAAAAACC 
AGCNCCCCCC 
TCMACCYYCA 
NTTATTTGGN 
TWAAAAAMCG 
NCSSKTTTTY 
CMGGGGGGNN 
YTCSYNANAA 
SCXCNKNGKY 

■JNGNNGNGNN 



GKTGYTGKTS 
WYCCGCGGGT 
NTTTACGGGS 
CCGTTGGTCA 
TGKYCTYCTY 
GCTGTCCATT 
GNTKGGCKCC 
CCCNATTCCA 
ANC C CTW AR S 
NGNTTTCGGG 
YGNCCMNCCC 
CYTTCCCCCC 
MKMTCMMCCC 

aagcnccccc 
ycccccccwc 

GGNGMNCKCG 



JNCGGGCGGG 

'/CGCTGGRAT 

CTTAAAANGA 

ACAACCCGGT 

CGGTACCCAA 

TSCTTGGCNT 

NRACCCRAAA 

TYSNGTTCCA 

TTTTNGRAAT 

MWNCXRKNST 

GGAAAAWTTT 

AMNWTGGGYS 

CCMTTTYYCY 

SCCKNCCCNA 

YNMNNAAAAA 

TTTTTTCSRA 

ECNAGAAMWK 



240 

300 

360 

420 

480 

540 

600 

o6C 

720 

780 

840 

900 

960 

1080 
1140 



(2) INFORMATION FOR SEQ ID MO: 33-1: 

. ? SEQUENCE CHARACTERISTICS: 
(A) LENGTH : II 05 base oairs 
;3; TYRE: nucleic acicf 
' C : STRANOECNES S ; s i na I e 
(Ci TOPOLOGY: linear 



MOLECULE TYPE: Oer.nrr: ~ DNA 
:xi, SEQUENCE DESCRIPTION: SEC IT "C : 3 3 4 : 
_ --i^^^u^.-i. ,v~. . . ?v /, . _GCAACTNGT MAASAGATCT 

_ -*aa : TCoGc aakanacacc accgcestot xtatacaccg :aaat^ttct gtktgccaaa 

"J;^^^ ^gytccaac ';c::tt aoy~r — ca:cagy tcagtgttra 

. _ ~ ~ ^ ' - ^v^r^jr j J ^- k -w.;\.T„irr.^. .-wGCTTTAKC '"AAGRAWYT3 GKT5GCCEGC 

'?™flf C ™ CGGTRATCCC AXC^WTCCCC CGGCCMRACE CACCSGGCAC TTT SRACGGT 
i .. _ -jCCAATT OAAAYCKYCT SRVrTCCTTC v 

w .'. .t joRACT CTAAGGCCEA 



•msg.v 



7>AAjMCCA ATTMCWGG r'.T MCNWf 



AAA AACASCrrNCC . r>TTAKCf~*vGG "E~*'*CSGGCA 

^ ~ " ^ " -j „ — . ^ „ . C?*l'..j .7 C E CAAN A " "E~" "aGGAAT," 

2^^'^;;;; 2TSCCCAAAY CCSGKCMAAA N7;GRKTTGGK TTNGGCNACC 

. - _ ^^^^rt nntttttana 3::;;t:cce"t ccccamaaaa ana:'cocny~ 

7TXKW ^'^ >:ctsccccng sn^ggggkc:-: sgkttattmt nnn~ S ccc~ 

wjt.-.j.->AAA AAATAiC-«T _ J YCCECCCNC ETCCKNCKNR JKAMSKSCGC TCE^Y-TCN' 



^o^tGS ^ z. 



36C 



" H 0 
84 J 
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(3) TYPE: nucleic acid 
vC) STRANDE0NES5 ; single 
(D) TOPOLOGY : linear 

MOL2CULE TYPE; Genomic DNA 

(xi) SEQUENCE DESCRIPTION ; 5EQ ID NO: 335: 

JSJf^ TMYCWYC TSCACSNGGA ACWANTGCGG CCRMAWCTNS TMKASAGATC 60 

CAAGAGCGGC AAGAGTGTGT GCATCTGGTC ANAGTST^MA CRCGGTGCCG 120 

C3GGTGKGTR GASCACMCAT NTGCGRACAC CAAACCCKTC GCGGGYCACC GGCKTCGCCT 180 
CZAGGC ^ TCHAACAAYW YCTYCTGCAA CGCAKGCCGT ScGcScCG 

RA.CCTGGKT CASYYCGCCK TGCGGTCCCC AAGKTACTGG CSCAYCAAAA CCGCTC~GGG 3^0 

W^WCS ATYC - RWT - RATAYCGCCA TGGCCTNKTC 420 

™ .xACCCAAAT CTTGGGTATC CTATANTKYC CCWAAANRCA AWTTO^rv 

:;:r::^:^ ^° atttammaca ncggtttctt tcwtaccaaa aaccs:,tggg 

^ZW.RA rtAAAKGATAA TAATAAKGTG CWWWCAAAAC CCCGCCCCCC ^T~- AAY 



";_„^ T ";* *'" CNj/iCAri ^^SGGNKN KGS S CNTTYT TWTTTYYNNA GGGGGGRRWr 

mhN yyc-jaankg ntcc-gsnma AAAGAGAMTT YCMKAAAAAC cccoicnc— 

J/wiiACCCC MAAAKWTTCM AAAS M S ONNG YCCCCC 

(21 INFORMATION FOP SEC ID MO : 3 3 6 : 

i ) SEQUENCE 2MARACTEP I5TT OS 
■A; LENGTH: 1042 case cairs 
•3) TYPE: nucleic acid' 
0 ! 0 TRANDEDNEE 3 . j; nq I ^ 



2 0 

54 0 
500 



'^^f AGGTNGGAAT TYTMAACCCC CAGGCCATAA SNTTNSGNAA 660 
«^ GGGMTTCSGS CCATKGYKCC AAAACEAAAA 7^0 
' '° Gi - R ^AAAAACC GGCCCNCCCG NAAATTTTTT GKCAACCCCA AACCTT7KAK 



700 
B40 
900 
936 



d; tofolcg 
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(2; INFORMATION FOR 3EQ ID NO : 3 3 7 ; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1073 base pairs 
(3) TYPE: nucleic acid" 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linpar 

(n) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 7 : 

™^ ^YACCSNr, GMTCWATTGC GGCC2MAWTC TNGTMAASAG 

-U^SU "c~^SS ACGCMAYGTC AAGTGTRAYY CGGT2ACATA TCMTCGCGNG 

____ _ "«^^v.^l wATCGGT RAT G CAACYT 

^___C ^x-^ XTCWT7RAGG -7. G C T AAAGG T CAMCAATTCC TRAGG TYM CN 



GC RArt ^ x'CTRAC "^~G7AAT7A T " T " , - - , „ „ 



NTYTTTTCC 



I TNFOrMATICN rC^ ID 

. 3EQUFNC7 -Tr-iARACTEkiSTl 'JS : 
A; LENGTH: 106: Das-- pairs 
3- TYPE: nuclei 7 acid 
7 : GTRANDEDNESS ■ 3 i:— l 



12 0 
130 



360 
420 
43C 



.lr,""^ iGG ' J wAArrAAGK ' n; rrGGGCAAC3 GCGGRWTCYC NCTGGCCGRA ATTCCCNCA' r 

7CCX_ AACG GKTGRACCGT TTYCCCGGYT GCCCTAAYTG YTYCNTGGGC GCCYTCGGC~ 

™«*ggcaa t^ua^c c~^Z; 

I™:™ ^:;:l^If CTG TNTCNTGRAA AANKCCACCN AACCCGGNTT 540 

^TT^C CC^t4 CGG^g'5 ™™ !£-^^ CMTYCAAAAG 60C 

n _ - ± uv-~rt o^^S Gv^_ ^vjTTTM CGMWCCT^RR 

G C GG G G AAAN AMTTSGGN~T 3GSCCGGTCC CCCGNAATAT YCNTGGNCCT 

^^--^ GSGNAYCCGG CCWTKGGGGK TNCCCAGTTG GWACAA— 

ArvL^^TTCIA AACCCGGGNC CGGGG 



780 



GGCCCNTTTT C CTMYNNAAA AAGKGTTTGN 84 0 

NCVV ^f^ff GGN CCNAACY YYYCAANTTC CANACCTTTA 900 

^ - TT ' ;CCSGS SAN—CCCCM MMS5KNCGGG AAAAAAAGMK 9fa0 

-~ ;;;"^- KTTT TWCTY "-"" kmhsnsnmcy kcskectikry .ignsnknc- 




WO <*9/420"76 
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T CTJJTY C GG T GGGGCSGGCR AN>TYTT3TCT YCCCNAASAN CTT AMY C CAN TTCG 
CGGKCAAAWS NGGGGGGGNA AAGGGCCCCC CGGNTSCKCC GGGGKXGC3G CYGG KTT CAA 

AANTTTCSGG 3KTSTMSCGG ^TCSCCGCC CSGCCAAGRA CCGNGGTTTT TTTTTGAACO 34 0 

KCMANTC3S A AMCCGCCSSC GGCMAAAGGS GC3TNAAWGR RAYTTNKSCC CNN AAA 3 3 GG 900 

CCCCCAKYTY SGGKTTCNNC GNC33GXFCGT CGMTSTTTMM MRCCCTTTGN GNKTTTTTAN 96 0 

MGSCCTTNNC CACCCCCYCK GGGKCSMNNA GAAJC7MYWKC CNGGGGNNAN RSCCCCCCNN 1020 

GSGKGGGGKG MGAGY5CCXT CTKGCGNCNN YKNTTTCCCC C i 0 61 

[2) INFORMATION rOR SEQ ID NO : 3 3 9 : 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 986 base pairs 
!B) TYPE: nucleic acid 
IZ) GTRANDEDNE33 . single 
TOPOLOGY, linear 

ii, MOLECULE TYPE: Genomic UNA 

gnngnnnkwn atm c a y C WYY GTS3A33S3G GMTCWATTGG G3CCGCAWTY tngtmaasag 

A3GTMGAAYT CGG 2ACANAG CGGCACAGAG TGTGTGCATC T3TGTCANAG CTGTCAACGC 

GGTGCC3CSG GTGGTRASCA CMCATTGCGR AACACCAAAC CCGTCCGCGG GYCAC33G3K 180 

T 3GGGT3CAA AAYC 3TCCAG 3C3A3GYCRA AACAAYVYCT CCTGCAACSC ARSC33TTY 

3 3GGC33RAT GCTGGKYCA5 YT 3GG3KTG3 GGTGCCCCAA GGTACTGGCS CWYCRANAC 

3 3TYCGGGRA ACCNAACGTA AATCTT3CGN AATTTGCNTT 3333CTSC3G TTRATNAATT 36 

TGTTAAA3GA 3GCAAACCTY 3GGGCKTCTC 3TGKTGCCRA WTCCGRWTCC RATNY3GCCA 420 

TGG 33TNXTC KYCTYCXYC5 GTMCCCAAAT CTTGGTATGC TATATTGTCC CT AAAT G C AA 4 80 

3 3_jTT3G33 3 3 AAA GUN AAA AATGATNATA ATAATGGTGC TNT 3 AAACCC CGCNCG3AT\ 

:natc3g:-:cc ammccccrgn 3gktant;kgg 3:;aattctt<m aagzcgaagg gataag:itt: 
jganaaag3y ncncmggyca 3~aaaacany ntt^rrtggny 3 3nttcggmn ycatgg 3 tni 

GMAAAAG33A AAT A 3 TNYY 3 3GY3 3AATAA AAMMMSGGY 3 3AMC3GGAAA CTTTTYTTG: 
. . iG I'Aj-lA 3 3* .'A AAXC '^TTTTT 3NAA3 333 A> J -iTrT" rr ~ ^ ~'" "?<^*? """v;v*j~ 



6C 
120 



240 

300 



30L 



CNSGGART'-OT 3 4 

J33v.T_..*G3A AT GrlYGCMAA AGNGGGRANA ~3ARCCCGAA TTCTTTJNNTN rGCrCTCO r:;.^ >c 
TRNAAAAGGG 3KN T TY3'CMAA, AA3 3*«CCN"0 \'3.'T""' 
NAANA3IC33N ZZZZZZ Z G*-!MMN 



_ ^ .J -AAAGA?'N . 3*i 96 C 

99n 
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s sssss sss s-ss 

SS2 ^ ssi 

SSSS SESS =^ £™ ~£ 

s= =s ™ ™ =s 

CSSOSmOAT TAYGGMSNMT ESSSS Sr™^ AATTraKTC CCYY«; 
TNAMCKCCTCN CCTOiM SSST S ™ANNK 

CGGAAMMCNN AATKGNNNS C C^m™ ^ MKCCNANMAA AYASGTJTTNK 

CCCNSNSGMN RRGAARMTNY SS^IT ™GMaA CT 

4 — ^SK.-t uKGNKAAAAW GKYCCCCCCM AAAG 

(2) I^ORMATION FOR SEQ ID Jn : 34i : 



i) SEQUENCE CHARACTERISTIC*^ ■ 
A: LENGTH: 113b base cairs 
:3! TYPE: nuclei acid 
(C) 3TRANDEDNESS : single 
(Dl TOPOLOGY: linear 



(::) MOLECULE TYPE 



3enarr, 



(xi= SECCENCE oescript;-;;: 



:xz:rr mt a cat on 



AAA WT G G C C A GC~G~~'— — RAACGCCGGG 
— ^^.^ .w.^GGNT- TACGCAACGC CNGCCCCCAA CHCT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1074 



„„„„ " " * vjwv^ . j;jTCWA^_GC GGCCGCAWK"' -"rr^^-- 
^C^GAAYT ACWCTGG^A — ^ — ^ *0 

attgngcgcy tca~-gc~ V — —c_.gg._g tgtgtacccc 12 0 

^CICACGG CKCTSCMTCT RAAGGCWTGG ~GCAC'G -^^X^ --C^ 19c 

:4o 

.i o n 



5 4 v. 

5 C ^ 
•5 c," 



AATCCGGYCC TrC~CANC^ — CNGCCCCCAA CRCTTTCTT, 

TCGAACTTPK T C7N AAAT " ^^^-GA ACTGGTTGC3 CNCGAYCTGC 3 so 

TTCNGGWANG TCGGC^CCG G^^^:: ™S GAAGC NGAACCTACT 

ygggaaaaag ^rcctcaatg ytygg~c^~ '!^^ GAACT 3GYKNN 5Ttg 

GAAGGCGMAG JG T T AANG C G v^^Z'^ ^c-™^" ^ GC:i CC::TGK -YCCCNAATG 

* JNKMAMWTT Y — '«GIJSG~* W ' " " ^ KWTTVCGG MGGANKAMNN 

^ yrTW ,_„ ""** VJ ^-' '-^ _"S . . AYAGA AI^ACTYCC^W WC^"^^— - 

jrt^u^Co GCGMG — ' t---^.^. m ^, t 'i^v,^,.., ^ 

7 G AA tfKWR ?7 Y JCYAGGGGGY ::r;r-VJ— -^V- ™° f " -C^GGGTi 

T.\TKSAG*';GG ^YCCGMAGMK — A -"^.T^.'V". . "1 -^K^C 7 C?rn:GW AAA 

-T.,.,v ,AG TAilTCGG GGCGSr^GA: WC3GYC'- f G;;G v;;->T'— -^'^^^1^ 

>^;GKT?-i>^rr msccjcgn ^^^^^^ ^ 

-ccttc^ ^m«^ ;,vma?-C,; : ^-'.-IT' ^ :G3NKSSKi; ^ 
3ktc:;gsc"- — ,^g.7-kvg:-- -.-~v~:.Y-'.'; .:"Y""'";* "" r ~ : ' : -5"-2Ns csangraga;, 

k-^j AGGGC 



:oq; 
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' ::) MCLECtTLE TYPE: Genomic DNA 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
CATCTGATC GTTGGCAACC AGCATPGrif: TP.r-'v^^ 



;^CrI^" AACGCCAATC AGCAACSACT GTTTGCCCGC CAGTTGTTG 



3;;tatcatg 



AirtLv-^CGA -AAGGTTTTGC GCCAT 
3GACTCCTG CATTAGGAAG CAGCC 



"CCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCC 



AGCATTTGCA 


60 


ATCGGCTGAA 


120 


GAGACAGAAC 


180 


TGCTCCACGC 


24C 


TGGTCAGAGA 


300 


GCATCCTGGT 


360 


TTGTGCACCG 


420 


CTGGCACCCA 


480 


AGGGCCAGAC 


540 


GCCACGCGGT 


sue 


TTCGCAGAAA 




GCATA r""r*"G 


~7 n 


* *■ a 'v- jGGC 


780 


*"*■*.- — ■ — _ G C 


3 4 0 


GCCGTTGAGC 


9 on 


CC2GGCCACG 


96 0 



G G G C CT G C C A C CAT AC CCA 

;:;I:::: A !' ^r! G :I^: G :: GATATAG g -~^gc^ ccgcIcctgt ggccccS 

";^;. T "^2 ;"2I^"™ C GGC37AGAGG A — -agatct cgatcccgcg aaattaatac 

" G ^~ A ^ G tgag ^ta acaattcccc tctagaaata attt-g— a 

GGAGATATAC ATATGGGCCA TCATCATC— CAT-AC—* ~™ 

~GG~ A' G~ ^-~* r ^^ — * ^^HLui^n . -^AC^TCAT 

~ A ™i :^?!:";::;, GGC3 --- ga - gcggtccagc gggcgcggga 

I—IIT. L^L * * CGGGT 0ATT3AGCAG GACATGGCCG TGGACAGC3C 

CAAGCT2GA AGTGTCGTTC AAGATGAGGC CGGCGCAACC 



.^rv_T ^ G 



- j AAA 

3A3TA0CCCC GCGTCGTCGC CGGTGAC 

^ — ~. - - - . . .jo L — GGGAT 

JGCCTCCGAC 3CCTATC7 

- .-u-a . . T AA J 3 G AAAAG7 

• ~- — ~ „.^LirtT? GCCGCGCT 

Jw.v^.-.^', _ -jACGGGTC 



3AAGC 



.oMn.nu333C GCCGGCGCCG GTACTGTCG 

3GCGGAGACC GGTAGCACGC TGCTCTACC 

033AGAGGTAT CCGAACGTCA "GATCACTG 

^ GGGACG3 CCAACATCG 

-rt* ^ -j ^. o ^-aC.AAGGGGC TGATGAACA 



' - --^> - ^-t. - j - .■■l^ - agggc 
~ — ^ j ^ j t 3AACCTCCCC 



-vTUuvj^nHlj . CGCCCC 

- - GrtG AAC 3 3 "A A 7 C 

- j. AT " GG C A. ~" " .3 ~ 
-jo^/vj-^IATjCT 3TG!3G" 



1030 
1140 
1200 

12 60 
1320 

13 B0 

14 4 0 
150 3 
156 3 
1 1> 3 T 
3 6 o 3 

I 3i 3 

:b d .. 
: y a ::■ 



- - - - — - - CG ACCAG 
a- j^jTTGATCG 3GA 3 3 
^ .Jrt^u^AG'aTA ATTT 1 "* 



vjy \ ' — T G AAAA 



3CCAGATCGA 3Sfl,' 
— jjriL j3'.' 3 S3 
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AACCGTCTCG CTYGACGCCA ACGGGGTGTC TGGAAGCGCG TCGTATTACG AAGTCAAGTT 

CAGCGATCCG AGTAAGCCGA ACGGCCAGAT CTGGACGGGC GTAATCGGCT CGCCCGCGGC JiOU 

jAACoCACGG GACGCCGGGC CCCCTCAGCG C7GGTTTGTG GTATGGCTCG GGACCGCCAA 3-40 

CAACCCGGTG GACAAGGGCG CGGCCAAGGC GCTGGCCGAA TCGATCCGGC CTTTGGTCGC 3300 

^™?f CG GCGCC3GCC3 GGGAAGTCGC TCCTACCCCG ACGACACCGA CACCGCAGCG 3360 

^.TTACCG GCCTGAGAAT TCTGCAGATA TCCATCACAC TGGCGGCCGC TCGAGCACCA 3 4 o 0 

CCACCACCAC CACTGAGATC CGGCTGCTAA CAAAGCCCGA AAGGAAGCTG AGTTGGCTGC 3480 

TGCCACCGCT GAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG TCTTGAGGGG ^540 

TTTnTGCl'G AAAGGAGGAA CTATATCCGG AT ' 2512 

(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 ammo acids 

(B) TYPE: ammo acid 

(CJ STRANDEDNESS : sinale 
:0) TOPOLOGY : linear 

{ n ' MCLEC" " TfT3" - 'jeD 1 " ' de 

(xi ; SEQUENCE DESCRIPTION : SEQ in - 0 : 3 4 3 . 

7al 31n ?he Gin 3er Gly Asp A,:: 5er Pro Ala 7a 1 Tyr Xaa Xaa 

Asp Jiy Xaa Arg 
20 

(2) INFORMATION FOR 3EQ 10 NO; 344: 

' : : SEQUENCE CHARACTER I3TI ZZ ■ 
■ 1 LENG TH : I J :im 1 no acids 
R' ~Y ?E : ammo acid 
~' 3T?ANT;EENESS : .3ir.a^ 

MOLECULE TYPE: peptide 

3ECUENCE EES CP. I PTIEN : SEQ IE NO: 344 
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(2) INFORMATION FOR SEQ ID NC:346: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 12 amino acids 
{B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Aisp Ala Gly Lys Xaa Ala Gly Xaa Asp Val Xaa Arg 
15 1C 

INFORMATIGN FOR GEQ ID NO: 347: 

■l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

in) MOLECULE TYPE: peptide 

>'xi) SEQUENCE DESCRIPTION- SEQ ID NO: 347: 
inr Xaa GIu Glu Xaa Gin Glu S,: ?ne Asn Ser Ala Ala Pro Gly Asn 



• INFO RMAT 11*:,* FOR GEQ IG NC:i48: 

SEQUENCE CHARACTERISTICS 
'A; LENGTH: I" 33.3 e pair:. 
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MOLECULE TYPE: Ccher 
iXl) SEQUENCE DESCRIPTION: SEQ 3D NO -34 9: 
GCAGTGACGA ATTCACTTCG ACTCC 

(2) INFORMATION FOR SEQ ID NO: 350: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 
(CJ STRAND ED NESS : single 
[D] TOPOLOGY: linear 

< 11- MOLECULE TYPE : cDNA 





SEQUENCE 0 


ES2RIPTI2N : 


- . ' o : 


1 1 V 

J 3 u . 






CATATGGGC 3 


ATGATCATCA 


TCATCACGTG 


•■w — xj/A , w.ATCiA 


T CGGG AC CAG 


CCCCACATC2 


50 


TGGGAA 3 AGG 


CwGCG-- iCGG A 


G 2 CGGTCCAG 


C-j G 3 G3CGGG 


ATA3 2GTCGA 


TGACATCCG2 


120 


GTwG'^T _ G^j j 


TCATTGAGCA 


G2A2ATSGC2 


GTGGA2AGCG 


CCGSCAAGAT 


CACCTAGCG2 


180 


AT _AA jCTC 3 


A-r\GTGTCSTT 


— -^A-jAT j AG j 


■ — u*j \ — s » — AAG 


_ o^vj- j-Gvj \_ T C 


GAAAC 2ACCS 


240 


AG C 0 3 TT C G 3 


CTGAAACGGG 


- j ■ — GGCG 2 3 


u-jTACTGTCG 


CGACTACCCC 


■-*jv_wT^G - CG 


300 


— J^-: - ja _ .J .. 


TG G C GGAGA 2 


■~ j jTAGCA j 3 




CGCTGTTCAA 


- w * G * jGGGT 


360 


_ ^'_i ( _3 TTT G 


A2 GAGA GST A 


T2 2GAA 2GT 2 


m. _ ja 2 2 AG CG 


2TC A 3GG CAC 


CGGTT 3TGGT 


42 0 


rj'jn ^ ■— J 


. — 3 i-AG-j C 2 G 2 


_ J -^JT'.Jrt . 7 


3TCAA2ATTG 


; ~r j - 2T3GGA 


2 3 2CTATCTG 


480 


*. ^ j'jry-Vj-lG J „ j 


ATATGGCCG 2 


3CAGAAGGG 3 


-TGATGAACA 


T 2GC 3 STAG 2 


CATCT 3 2GCT 


540 


Cn ; j _AGGT3A 


A2TACAA2CT 


1 j GvjAGTG 


'Vj^ .j^jjCACG 


j- — Aj>a : _j ^.TvjAA 


- GGAAAAGTC 


600 


_ * ' J J IU "A 


_ ' j a ACGAGGG 


w m „ - - 

— .V. ^ a -.-LtVj-a. 


-n _ i >jijomCG 






6 6 0 


•-AjM. O' J". I 


* jAAC2TSC2 


- S 3 2 AC 2 2 2G 




- _ j „ T^" 


2 3ACGGGTC 3 




J i. JA _ .1 _ ^_ . 


- -TTSTT2A2 


— ~v j i. A w CCG 


. .. ..rtAGGAAi.i 




. TGGGG 3 AAG 


'7 8 0 




- C-* C 2 A 2 














1: — irt 1 _ 






t'.r t i u.'vjAAG 


8 4 0 


J'-J 1_ V "jT J 


■~i — -.-vTG jT jAC 


— ■ jTTS 2 3 22 


GAGACA 3 3 3G 






o 




- -jACCAGGC 


—.-Vj - V.AA 2 33 


-j' jA 3T2 3G 23 


.. _AA'._ . 




9b 3 


- — - 'J J — \j- 1 ^ - 




Jrt _ j-, ; _.-i_A 


AGG ATT 3 AGG 


w j : ~ jijC 33 2 




: c 2 




— iuC SAAC 2 A 


-tj . — ;nTTT 22 


.■a ^ Gj^. _ 3 A 3 3 




1 >v.jACu jCTAC 


1080 


-Con. _ATCA 


A 2TAC GAGTA 


- j_CATCGTC 












.-vn : — _ 3 3 3 C 


.-win j jmC 3 G 




;i4C 








• \ - - M \ y ^ J J~\ _ J 


j ^- . V\ _ . Aj"V' J J _ 






























. . .3 „ ^ 3 3 AT *" 










i AT 2- 2 3 3 3TA 


_ . .J .... 1 


.j'j/^vjvj 33i.GG''" 




i ~ _. O J-A 1 J „ 







- .... ^.u,: , . 3 


. \- _ .--V i a. G iA 


i * . - ^ ■ jG c.a 






z-vG vj 3 C 3A GT 3 




j — ro 3 AC 3G 


3 3 3 C 2 CAG'GC 


G'_ 3GTGGT3 


; 4 4 " 


- J _ - - . '..".-.J 




- A, \3C3\GAA3 




. \ v. .jn ja ' „ 


1 ortATATT 




1 ~ _ rt'jo _ _ o 


-j _ o * uvTA 


. ; _'^jAGGGG2 


SAC 3 AGG AG C 


A^j „ A i .j-_ AG 1 .; _ 


^ ^jTGCTCC 








i \AC3 3 22 GG3 


j '3 ?T 




AGC3CCACCC 


: 6 2 3 


_t — .-v — jr. 


ctgttgc 


■ — _ . — AC GA 


- — j'o — GC'33 






163C 










_ ._ .^1^1. 


o AATSCC CAG 








.j ^. _ ^.v\. 3 33A 


.~\ w J w ."A w ^ _ C 


-j ^. \ w 3T G T 3" 


1^4 3 


































o j/i, , — j , T3 A Z 






«. - 1 --a\j ZZ .\ 3T . 
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ATCGGCTCGC CCGCGGGGAA CGCACCGGAC GCCGGGCCCC. CTCAGCGCTG GTTTGTGGTA 222 C 

TGGCTCGGGA CC3CCAACAA GCCGGTGGAC AAGGGCGCGG CCAAGGCGCT GGCCGAATCG 22 8 0 

ATCCGGCCTT TGGTCGCCCO GCCGCCGGCG CCGGCACCGG CTCCTGCAGA GCCCGCTCCG 23 4 0 

GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT ACCCCGACGA CACCGACACC GCAGCGGACC 24 0 0 

TTACGGGCCT GA 2412 

(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i'J SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 302 ammo acids 
(BJ TYPE: ammo acid 

(C) 3TRANDEDNESS: single 

(D) TOPOLOGY; linear 

;::) MOLECULE TYPE : protein 

;xu SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

Met Gly Hi 3 Hi: His Hi;; Hi g Hij Yal lie Asp lie lie Gly Thr Ser 

I 5 io 15 

Pro Thr Ser Trp Glu Gin Ala Ala Ala Glu Ala Val Gin Arg Ala Arg 

20 25 30 

Asp Ser Val Asp Asp lie .Arg Val Ala Arg Val lie Glu Gin Asp Met 

3 5 4 'J 4 5 

Ala Val Asp Ser Ala Gly Lys lie Thr Tyr Arg lie Lys Leu Glu Vai 

b 0 b 5 6 0 

Ser ; j :ie .yc Met .Arg Pro Ala Gin ^r: Arg Gly Ser Lys Pro Prcj Ser 

65 ^0 75 80 

Gly Ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pre 
3 5 9 0 9 5 

" Val ~hr lei; Al.i Glu Thr G 1 v 3ei Thr Leu leu Tvr 



130 125 140 

via Ala Ala Gly Thr Val. Asn lie Gly A^a Ser Asp Ala Tyr Leu Ser 
4 5 15 1 15 5 16 0 



i a Leu A3 n . J rt ^^v uL /vrjn 



.hr ,^a . a_ ':,;u ?:: Leu Arg Ger Aar G^y Ger Gly 

111 2 ■ 5 2 4 0 
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290 295 300 

Gin Ala Ser Gin Arg 31 y Leu Gly Glu Ala Gin Leu Glv Asn Ser Ser 
305 310 315 320 

Asn Phe Leu Leu Pro Asp Ala Gin Ser He Gin Ala Ala Ala Ala 
325 330 335 

Giy Phe Ala Ser Lys Thr Pro Ala Asn Gin Ala He Ser Met He Asp 

340 345 35 0 

Gly Pro Ala Pro Asp Gly Tyr Pro lie He Asn Tyr Glu Tyr Ala lie 

355 360 365 

Val Asn Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala 

370 375 380 

Phe Leu His Trp Ala lie Thr Asp Gly Asa Lys Ala Ser Phe Leu Asr> 
385 390 J95 40Q 

Gin Val His Phe Gin Pro Leu Pro Pro Ala Val Val Lyc Leu Ser Asp 

405 410 415 

A.a Leu He .Ala Thr He Ser Ser Ala Glu Met Lvs Thr Asp A1a Ala 

" tZ0 425 430 

T::r Lou Ala Gin Glu Ala sly Asn Phe Glu Arg He Ser Gly Asd Leu 
4 3 5 440 44S 

-ye ~hr Gin He As? Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Glv 
450 455 46Q 

Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arq 
4 6 5 47n , c 3 

* u 4/5 480 

Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asd Glu He Ser 

485 43C 495 

Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu Glu 

505 Sl ° 

Gin GiP. Gin Ala Leu Ser Ser Gin Mer Glv Phe Val Pre Thr Thr Ala 

5-5 520 5 ;;5 

AlA ?ro Ser Thr Ala Ala Pro Pro Ala Pre Ala Thr Pro 

— - dJB 540 

A: * ?rr: ?ro ^~ Ala Ala AGr ' Thr ? — As ~ ^ 31;: Pro 

S n 5 - - '■ - ^ 

Pro Pro Val II- Ala Pr~. Asn Ala Pro sin Pro Val Arc: He Asd Asn 

-30 



Phe .^r Phe Ala ^ 



jdb 530 

ro Ala 31 v Tro Val Glu Ser 



■j .: , 



^ -i - 5 = : 

Tnr Asp 3e: Lys A .a Ala Ala Ar-; l.eu slv Se 

6 73 

T'.'r Me t Pro Tv - • ^ - ■ , - > . , . _ , ^. 



*3 L 



— a Ser Ala Gl : 
5 5 5 

:in Met Glv Gl: 



'a ^ 5 o : 
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?he Val Val Trr) Leu 
740 

Ala Lys Ala Leu Ala 
755 

Ala Pro Ala Pro Ala 
770 

Gly Glu Val Aid Pro 
785 

Pro Ala 



Gly Thr A^a Asn Asn 
745 

Glu Ser lie Arg Pro 

760 

Pro Ala Glu Pro Ala 
77 b 

Thr Pro Thr Thr Pro 
790 



Pro Val Asp Lys Gly Ala 
750 

Leu Val Ala Pro Pro Pro 
765 

Pro Ala Pro Ala Pro Ala 
780 

Thr Pro Gin Arg Thr Leu 
7 95 goo 



(2) INFORMATION FOR SEQ ID NO: 3 52: 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
3TT1ANDEDNESS : single 

:0! TOPOLOGY : linear 

;ii; MOLECULE TYPE . Other 

(XI ; SEQUENCE DESCRIPTION: SEQ ID NO: 3 52: 

TCCAAAC CACCGAGCGG TTCOCCTGAA ACGG 

2 INFORMATION ?0H SEQ ID NO. 3 53: 

; i ; S E QUENCH CHARACTER I S T I CS : 
*A; LENGTH: 3 7 base pairs 
B) TYPE: nucleic acia 
' " : STRAND EDNES S : 3 i na 1 e 
'D* ""OPOLOGY- linear 
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GCCGCCGGGA CGGTCAACAT TGGGGCCT-r GACGCCTATC TGTCGGAAGG 7GATATGGCC 
GCGCACAAGG GGCTGATGAA CATC3C3CTA GC=ATCTCCG "TCAGCAGG? CAACTACAAC 
CTGCCCGGAG TGAGCGAGCA CCTCAAGCTG AACGGAAAAG TCCTGGCGGC CATGTACCAG 
GGCACCATGA AAACCTTGGGA CGACGCGCAG ATCGCTGCGC TCAACCCCGG CGTGAACCTG 
C=CSGCACC5 CCGTAGTTCC GCTGCACC3C TCCGACGGGT CCGGTGACAC CTTCTTGTTC 
ACCCAGTACC TGTCCAAGCA AGATCCCGAG GGCTGGGGCA AGTCGCCCGG CTTCGGCAC" 
ACCGTCGACT TCCCGGCGGT GCCGGGTGCG CTGGGTGAGA ACGGCAACGG CGGCATGGTG 
ACCGGTTGCG CCGAGACACC GGGCTGCGTG GCCTATATCG GCATCAGCTT CCTCGACCAG 
GCCAGTCAAO GGGGACTCGG CGAGGCCCAA CTAGGCAATA GCTCTGGCAA TTTCTTGTTG 
CCCGACGCGC AAAGCATTCA GGCCGCGGCG GCTGGCTTCG CATCGAAAAC CCCGGCGAAC 
CAGGCGATTT CGATGATCGA CGGGCGCGCC CCGGACGGCT ACCCGATCAT CAACTACGAG 
TACGCCATCG TCAACAACCG GCAAAAGGAC GCCGCCACCG CGCAGACCTT GCAGGCATT" 
^Z^l^ CGATCACCSA CGGCAACAAG GCCTCGTTCC TCGACCAGGT TCATTTCCAG 
CC3CTGCCGC CCGCGGTGGT 3AAGTTGTCT GACGCGTTGA TCGCGACGAT TTCCTCCGGA 



JZT^ C ^ C3C " '' AA '-' t ----' A! - i CCGGGCGATC CCAACGCAGC ACCTCCGCCC 



AA 



.GGCG CI 



12) NO • ibb : 



zq'cesc2 character 
length: .3^2 

TYTE: .immo ic — 
GTRATTEEEriE 



1200 
1260 

1380 
144 0 
150 0 



GCTGCCTG CTGGCTGGGT GGAGTCTGAC 
Jcgactacgg ttcagcactc ctcagcaaaa CCACCGGGGA CCCGCCATTT 
m^™^" GGCCGCC:GGT ggccaatsac ACCCGTATCG TGCTGGGCCG GGTAGACCAA 
AAGCTTTACG CCAGCGCGGA AGCCACCGA2 TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC 156 Q 
l CZ 2lt TGC:: CTACCCGGG - ACCCGGATCA ACCAGGAAAC CGTCTCGCTC 162 0 
GGGTGTCTGG AAGCGCGTCG TATTACGAAG TCAAGTTCAG CGATCGGAGT 
GCCAGATCTG 3ACGGGCGTA ATCGGCTCGC CCGCGGCGAA CGCACCGGAC 
C.CAGCGCTG n.. GTGGTA TGGCTCGGGA 2CGCCAACAA CCCGGTGGAC 



16 8 0 
1740 
180 0 

^ _ TCGCC -- GCCGCCGGCG 186 0 

I~Tl~^ a- AGA GC::::GC ' r GCGG "GGCGC 2GGCCGGGGA AGTCGCTCC1 



1920 
196 2 
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100 105 no 



Ala Gin Gin Val Asa Tyr Asn Leu Pro Gly Val Ser 31 u h2s Leu Lvs 

il3 120 X n 5 

Leu Asn Gly Lys Val Leu Ala Ala Het Tyr Gin Gly Thr He Lys Thr 

^ 3 ^ 13 5 

Trp Asp Asp Pro Gin He Ala Ala Leu Asn Pro Gly Val Asn Leu Pro 

- 4S 150 1S5 160 



Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
Phe Leu Phe Thr Gin Tyr Leu Ser Lys III Asp Pro Glu Gly Tr^ Gly 



1U > 190 
u Y s oer Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gl- 

. L f 5 200 205 

«-a _y 31u Asn Gly Asn Gly Gly Met Val Thr Gly Cvs Ala Glu 

- 15 220 
Thr Pro Gly Cys Val Ala Tyr He Gly He Ser Phe Leu Asn Gin Ala 

:; r . ..... :? 5 r - 35 :.c 

-•^^ . , y ^ u As „ 3(Jr .^^ 

243 250 ^55 

?r.e -ou .eu Pro Asp Ala Gin Ser He Gin Ala Ala Ala Ala Glv Phe 



2G 

Ser 



26b n 1( 



n- r_ ^a Asn G .n Ala lie Ser Me- lie Aso Glv P^~ 
= 280 ^ 85 
Ala Pro Ado Sir Tv- --^ , c _ ^ ■ ,, 

290 " AS " '~ " T -'' r Ala ^ Val Asr. 

LyC ^ iUa Thr A];i ^ Thr Leu Gin Ala Phe Leu 

310 315 l^r, 



:rp Ala He Tnr A^c Gly Asr. Lys Ala Ser Ph 



Leu Agtd Gin Val 

Hi.; ?h- v- a _ , w .. ., . 335 

'~ " r= 7a - '•'/-> Leu Ser Aed Ala L---U 

.... ... _^ J ™ ^ , 34 5 3 50 

; r 'I ~~~ ~ er 3 - r 3 * v 31v 31 -' 3«r Gly ily Ser 31 v 31 v 

r •-. , ■ -.. ... "i 60 

Alrf : ' rT ' ~ r ° ~* ? « Pro Va: A],' ore Pro Pro Pro 

-■ . - _ u i9 ° 3 " , 00 

°*** ■' * .-' ^ - ' y " Acn Ala ,\1j 
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S« Oly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys 
550 555 5 6 0 

Pro Asn Gly Gin lie Trp Thr Gly Val He Gly Ser Pro Ala Ala Asn 

5 o 5 5 70 

Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp Phe Val Val Trp HI Gly 

580 585 590 

Thr Ala Asn A= n Pro Val Asp Lys Gly Ala Aia Lys Ala Leu Ala Glu 

600 605 

Ser lie Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro 

Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly "u Val Ala Pro Thx 

Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu Pro Ala ^ 
645 550 
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CLAIMS 

1. A polypeptide comprising an immunogenic portion of a soluble 
M tuberculosis antigen, or a variant of said antigen that differs only w conservative 
substitutions and/or modifications, wherein said antigen has an N-tcrminal. sequence .elected 
from the group consisting of: 

(a) Asp-Pro- Val-Asp-AIa-Vd-Ile-A^ 

VaJ-Val-Ala-Ala-Leu: (SEQ ID No. 120) 

fb) .^a-Val-GIu-Ser-Gly-Met-[.eu-AIa-Leu-G]y-Thr-Pro-AIa-Pro-Se 

(SEQ ID No. 121) 

(O Ala-Aia-Met-Lvs-Pro^ 

Lys-Glu-Gly-Arg; (SEQ ID No. 122) 
(d) Tyr-T>T-Trp-C^s-Pro-GIy-^^ 
(SEQ ID No. 123) 

(e> .Vsp-Ile-GIy-Ser-GIu-Ser-Thr-Glu-Asp-GIn-GIn-Xaa-Ala-Val; (SEQ 
ID No. 124) 

if) AIa-Giu-Giu-Ser-IIe-Ser-Thr-Xaa-(]Iu-Xaa-Ilc-Vai-Pr t ): <SEQ ID No. 
125) 

(gl Asp-Pro-Glu-Pri^Ala-I^ 

Ser: ,SEQ ID No. ;2t>> 
fhj Ala-Pro- [Av-'ITir-r;^ 
(SEQ !D \u 12"» 

m AM>Pro--\i j-Ser .\ia-Pro~\sp-\ al-Pro- i :ir-\la-Ala-Gin- i .eu- fhr-Ser- 
;.ju-:.ju \,i>^cr-i — \ .--[>;, -.-\-n-\ ai-Ser-f n ]c- -^a- \:,:\. .\SEvJ 
ID No ::S)jnd 

U ' Ala-Pro-Giu- Ser < ii> -Aia-i -L.-u-GK -< y l} -Thr-VaU ih- Aia-uiv. 
(SEQ ID \o !>m 
'■■Giere:r. \aa may ~e am i;r,w;< 
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substitutions and/or modifications, wherein sad antigen has an N-terminaJ sequence selected 
from the group consisting of: 

(a) Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Mct-Thr-Lys-GIy-Tyr-Tyr- 
Pro-Gly-Gly-Arg-Aig-Xaa-Phe; (SEQ ID No. 129) and 

(b) Xaa-T>r-Ile-Ala-Tyr-Xaa-Tlir-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys-Ile- 
Asn-Vai-His-Leu-Val: (SEQ ID No. 137). wherein Xaa may be any 
amino acid. 

3. A polypeptide comprising an immunogenic portion of a soluble 
1/ tuberculosis anngen. or a variant of .aid antigen that differ, only :n conscrvat.ve 
substitutions and/or modificauons. wherein said antigen composes an ammo ae,d sequence 
encoded by a DNA sequence selected from the group consisting of the sequences recited in 
SEQ ID Nos, I. 2. 4-10. 13-25. 52, 99 and 101. the complements of said sequences, and 
DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 1. 2. 4-10. 13-25. 52 
09 and 101 or a complement thereof under moderately stringent conditions. 

4 A poiypentide comprising an immunogenic portion of a 
U ' Wc,W antI ? en - or J v: ™ " said antigen dial differs only in conservative 
substitutions and-or modifications, wherein said antigen comprises an amino acia sequence 
encoded by :, DNA sequence selected from tnc group consisting at tnc sequences recited in 
SEQ ID Nos, 26-51. 138. i :«> ; (V ;.;, S ,, :o; :4()> :42 -:47. 253-250. 295-298. 309. 3lo. 

31S-320. 322. ^-i — r,,,,,, - . • . ; 

" an J -i. .tie complements ol said sequences. 

and DNA -.cuuences :nat hvbndi/e to i sequence recited in SIT) !D Nos . 2^;. '.3X. ]}» 

• (v _ w S _ lM w ; , D> ; ] S ^ :() _ ; ^ 

" ^ ' ' * and Ul or 1 C0in ^ ,emenl «w™r' :macr moderately stringent conditions 



A DNA molecule comnnsniL: a nucleotide seuuencL- 



jncociim 7 a 
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"7. A host ceil transformed with 



an expression vector according to claim 6. 



8- The host cell of claim 7 wherein the host cell is selected from the group 
consisting of £. coli y yeast and mammalian cells. 

9- A pharmaceutical composition comprising one or more polypeptides 
according to any one of claims 1-4 and a physiologically acceptable earner. 

10. A pharmaceutical composition comprising one or more DNa 

molecules acenrriind tn r-1-.irr, < .. _i -_t ., 

--, ... _ mvi a uuvaiuiuiijcany acccptaDIc earner. 

11. A pharmaceutical composition compns.ng one or more DNA 
sequences recited in SFQ ID No, 3. II. 12, !40 . 141. 156-160. 189-193. 199, 200. 203, 
215-125. 2" 23«. 261-2-6. 292. 293. 303-308. 310-315. 317. 321. 323. 325-327. 330-332.' 
33 -- 336 - 338 - 340 wd -' 4 -- ?47 - «d a physiologically acceptable earner. 



1-. A vaccine comprising one or more polypeptides according to anv 
01 ' daims Mand a non-spcenk immune response enhancer. 



one 



■ ' \ vaccine eomnrisnm 

U p0i> ' pCptidc hav,n « M N-»emunai sequence .elected from the group 
consisting of sequences recited in SEQ ID NO: p. 4 and !35. and 

J non-specific immune : -~.nor.se ennxneer 

'■ '• accir.e eorr.rriMni: 

one or more Doivreouiies •••>..« i ^ . \ ■ - 

. . i-^uucs ..^oj^j a i)NA .sequence selected ;rom tne 

:roup consisting of SFO !D Nos.. ;, ;;. ; : . ; ,,, ;,■ ;Sw .. (;; , uq , ()() , () , 

::5. :-o. 26: 202. 20;. , r ..; 0S . v- v , ? , 5 3r ; , ; "... 

''" S " ' 4 " ^ U; " U " rhc «-™wi«nem.v of said sequences, and DNA sentiences that 
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a non-specific immune response enhancer. 

15. The vaccine of claims 12-14 wherein the non-specific immune 
response enhancer is an adjuvant. 

1 6. A vaccine comprising one or more DNA molecules according to claim 
5 and a non-specific immune response enhancer. 

1 / . A vaccine comprising one or more DNA sequences recited m SEQ ID 
Nos.: 3. 11. 12, 140, 141. 156-160 1 QQ ^nn ~>n? ^is ^ n 

292. :<•:, 303-308. 3I0-1I*. 3,7. 3::. 3:3. 325-3:". .30-332. 334. 336. 338. 340 and 342- 

34"; and a non-specific immune response enhancer. 

18. The vaccine of claims 16 or 1 7 wherein the non-specific immune 
response enhancer is an adjuvant. 

19. A method for inducing protective immunity m a patient, comprising 
administering :o a patient a pharmaceutical composition according ro any one of claims Q-il. 

20. A method for inducing protective immunity m a patient, comprising 
administering to a patient a vaccine according to any one of claims 12-18. 

-!■ A fusion protein comprismr two o r more polypeptides according :o 
arw . me or claims \ -A 

- X :mum - n ^m ->mrrsinr. ->e a puue poi> peptides according :o 
any one or claims 1-4 and FSAT-^r 

A Iasu,n rmotem comprising >me or more polypeptides according *o 
any one ofoiaims 1 -4 and :he 'A tuh.-^utmiv iptw" • y - > oao > -n v "o ■ ■ ^ 



24. A pharmaceuticai composition compnsing a fusion protein aecordini: 
to any one of claims 21-23 and a physiologically acceptable earner. 

25. A vaccine comprising a fusion protein according to any one of claims 
21-23 and a non-specific immune response enhancer. 

26. The vaccine of claim 25 wherein the non-specific immune response 
enhancer is an adjuvant. 

2/. A method for ukVulu punecuve immunity in a patient, comprising 
administering to a patient a pharmaceutical composition according to claim 24. 



28. A method for inducing protective immunity m a patient, comprising 
administering to a patient a vaccine according to claims 25 or 26. 



29 A method for detecting rubercuiosis in a patient, compnsing: 

(a) contacting dermal cells of a patient with one or more polypeptides 
according to any one of claims 1 -4; and 

(b) detecting an immune response on the patient's skin and therefrom 
detecting tuberculosis in the patient. 

30 A method for detecting tuberculosis m a patient, compnsing: 

g dermai ce:N or i panent with polypeptide ria\ 
ti *ue group consistinu of sentiences recited in SFu ip \M 



' a! contacting dermai eeils of i patient with a polypeptide havmu an \- 
terminai sequence selected fnn 



^ ana . ano 

:bi detecting an immune response on the patient >km and iherefr 
detecting tuberculosis in the patient. 

A i^etr.od tut deieetmg tuberculosis m a patient, comnnsinn: 
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310-315, 317. 321. 323, 325-327, 330-332, 334, 336, 338, 340 and 342-347, the complements 
of said sequences, and DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 
3, 11, 12, 140, 141, 156-160, 189-193, 199, 200, 203, 215-225, 237, 239, 261-276, 292, 293, 
303-308, 310-315, 317, 321, 323, 325-327, 330-332, 334, 336, 338, 340 and 342-347; and 

(b) detecting an immune response on the patient's skin and therefrom 
detecting tuberculosis in the patient. 



is indurauon. 



(a) 
(b) 



The method of any one of claims 29-3 1 wherein the immune response 



A diagnostic kit comprising: 

a polypeptide according to any one of claims 1-4; and 

apparatus sufficient to contact said polypeptide with the dermal cells of 



a patient. 



34. A diagnostic kit comprising: 

(a) a polypeptide having an N -terminal sequence selected from the group 
consisting of sequences recited in SEQ ID NO: 134 and 13 5; and 

(b) apparatus sufficient to contact said polypeptide with the dermal cells of 

a natient. 



35. A diagnostic kit comprising: 

on ,i polypeptide enc^iied a DNA seuuence -.elected from tne "roup 
jonsisung of SEQ !D Nos.: 3. A. 12. 140. ]4i. i^o-loi). 1S 1 >-193. I t)C >. 200. 203. 21 5-225. 
25 A 22 { >. 2oi-27o. 292. 293. '03-508. .3 1 tj-.5 1 51 A 521. 525. 525-52 A 3.30-332. ",4. 3 36. 
33 8. ; 40 and 342-^4". the complements of said sequences, and DNA sequences that 
hvnndize to a sequence recited in SEQ ID Nos.. 3. 11. 12. 140. 141, 156-160. 189-193. ] l >9. 
200. 205. 215-225. 25A 259. 2d i -276. 2 l >2. 2 ( >5. 503-^08. 510-.A>, 5T. 521. 525, 525-52A 
330-3 ;2. -4. 33 0< 33^ 340 and 342-54-; and 
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36. A diagnostic kit comprising: 

(a) a fusion protein according to any one of claims 21-23; and 

(b) apparatus sufficient to contact said fusion protein with the dermal ceils of a 
patient. 

37. A fusion protein according to claim 23 comprising an amino acid 
equence selected from the group consisting of sequences recited in SEQ ID NO: 153. 209, 
51 and 355. 



"76 



D7 T Cell Proliferation 



a 
u 
c 



c 



900Q 



•TOO 



4000 



2DC0 - 




■ 



D7 TFS* 



Q-i 



It :i 

v 



m a 

2 20kd 
! Z 25k£ 



W O 9 l > ■'42070 



14 



VC I 03268 



D160 T Cell Proliferation 



IOOOO- 




DI60 TFSs 



1-5- 



3_j - 



ISIS.. 

if 



■ 0 

a cs? 

a I4kd 
3 20kd 
"J 26kd 



WO 42<n, 



VCT 1'SW(I326N 




WO 99/42076 



r( i i. ; sw<u:()X 




W O 94 '42076 



I'C I l 032(>S 




WO 99/42tr6 



P( T l N<>9 03208 



T Cell Clone PPD 800-10 IFNg Production 

2J3i 



1^ 

! 

ixz 



* 

O 



1.5- 




Medium 



CSP 



sor 



WO 



VC'Y/i N9 l > .-03268 



D131 T Cell Proliferation 



sooo 




5 2000" 
m i 




20 5^ 2 .S3 0 

Antigen igfmL 

D131 IFNg 

3-0- 



\ 




WO 99/42076 



N 1 I 



P( T I N99 0326S 



D184 T Cell Proliferation 



© 



L0080-T 



S 8000- 



z sooo 



- 4000" 



2000 




Antigen 

D184 IFNg 



3.0- 



2.0- 



3 " 



~"~ TbHS-38.: 
-•— TbE9 
— 38.1 



L0- 



0-5 



WO l )9 12(l"6 



PC I I S^9 



D201 T CeU Proliferation 




Anrigeii igrml 

D201 IFNg 



l£2 



TbH9 



WO W42(P<) 



1(1 l-t 



PC "I I VJ9 0J2(>S 



<u L3CO- 



3 
3* 



580-+ 





Anriyen: 



J-5 - 



FIGS X4-R 



WO W/42076 



1-1 



I'C I TS99/0326S 



DlfliTCaEPraiifffrartnn 



2 *ooo-i 




•v.*. vw. 
Vav.v. 



2 



5 



3-5- 



IP 




o 



/7GS 1 . 9.4-5 



WO 99 42(P6 



1-1 



PC T 1 0326S 



Tuberculosis: Protection of Cyncmcigus Monkeys 
with Recombinant Antigens of Mtb 
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